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Strength Decrements from Carrying 
Various Army Packs on 
Military Marches: 


H. HARRISON CLARKE 
University of Oregon, Eugene, Oregon 


CLAYTON T. SHAY AND DONALD K. MATHEWS 
Springfield College, Springfield, Massachusetts 


Abstract 


Two independent pack-carrying studies were conducted in which the strength losses 
of involved muscles from nine 7.5-mile marches were determined. In the first series of 
marches, the rucksack had a clear advantage: not until the weight was 61 pounds were 
strength losses appreciable. Contrarily, in the second series, marching with the rucksack 
resulted in greater strength losses than with the combat pack. The difference is attributed 
to the superior physical condition of the subjects on the first rucksack march. The muscle 
groups showing greatest strength losses for the nine marches were: trunk extensors in 
eight marches; hip extensors in seven marches; and knee flexors in six marches. The 
neck extensors appeared in two of the three marches of the second series, but were not 
tested in series one. 


HIKING AND CAMPING are of increasing importance to physical education 
and recreation. If advanced instruction is to be given in these subjects, it 
should be based upon research information. This study deals with certain 
specific problems of importance in h*t»g with a pack, namely the determina- 
tion of what muscle groups are most subject to fatigue, the degree of local 
fatigue, and the relative influence of pack position and load. 

These problems are also of interest to the military services. Throughout 
history, armies have been faced with the problems of supplying soldiers with 
clothing and personal equipment, food and shelter, weapons and ammuni- 
tion. The problem is relatively simple when soldiers are in encampment; on 
the march and in combat, it is much more difficult. The expanding scale of 
military operations and changes in technology have increased greatly the 
complexity of this situation. In the solution of supply, necessarily the infan- 
tryman has been forced to transport essential and immediately needed items 
of equipment on his person. 

The foot soldier of early armies, such as the Roman legionnaire as well as 
the infantryman of France, Russia, Germany, and Japan prior to 1900, car- 
ried loads of 60 to 65 pounds and more. Frequently, these soldiers reached 
the battlefield in such a fatigued state that effective combat was impossible 


1This study was conducted at Springfield College under subsidy from the Climatic Re- 
search Laboratory, Quartermaster Corps, Department of the Army, Natick, Massachu- 
setts. Acknowledgments are expressed to the following graduate assistants who assisted 
in the collection and analyis of the necessary test data: Gavin H. Carter, David H. Clarke, 
Karl K. Klein, William Mahon, Barry Thompson, and James E. Torpey. 
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(2). As has been true of modern wars, much of the equipment thus carried 
was thrown away along the route of march. The soldier in combat frequently 
discarded everything he could do without with no particular concern as to 
what happened to it. 

Much has been written and said on the problem of what, where, and how 
much the soldier should carry. Despite all the controversy over the centuries, 
little scientific evidence is available for the solution of this problem, decisions, 
for the most part, being based upon opinion. The lack of factual data has 
been the single greatest deterrrent to the development of a completely func- 
tional system of load carrying. During recent years, the Quartermaster Corps, 
United States Army, has conducted investigations into the design of special 
packs and into ways of lightening the weight of essential items of equipment. 
Through its Climatic Research Laboratory, the effects of pack carrying on the 
human organism are being investigated under experimental conditions. 

A review of the physiological research on pack carrying has been presented 
by Bailey and McDermott (1). In the main, these studies have two charac- 
teristics: (a) The evaluative processes have been based largely on metabolic 
determiners; and to a less extent, on electromyographic techniques, center of 
gravity measurements, and the judgment of subjects. (b) The experimental 
procedures have been confined mostly to treadmill marching, thus without 
reference to the field conditions encountered by soldiers or by hikers over un- 
even terrain. These reviewers conclude that the results so far obtained are 
meager and frequently inconclusive. 

Shortcomings of the investigations reported are as follows: (a) Many stud- 
ies were based upon limited observations of a small number of subjects. (b) 
The state of training and other descriptive measures of the subjects were not 
known. (c) Psychological motivational factors in pack-carrving performance 
were ignored. (d) Energy cost studies do not adequately reflect the fatigue of 
local muscle groups. (e) Only muscles directly underlying the pick-up elec- 
trodes can be investigated by electromyography; and there is no differentia- 
tion between action and stabilizing muscles. (f) There was no reliable mea- 
sure used for determining degrees of muscular contraction. (g) The center of 
gravity studies were conducted largely from a standing rather than a march- 
ing position. 


Background of the Study 


The present investigation into the effect of marching with military packs is 
based upon two major factors, as follows: 

1. The hypothesis for the evaluative process is that muscles under physical 
stress may weaken when carrying loads. The validity of this hypothesis was 
clearly demonstrated in another study reported by the authors (4): The 
subjects fatigued their elbow flexor muscles by ergographic exercise with a 
load equal to three-eighths of the strength of these muscles; mean loss in 
strength 30 sec. after such exercise was approximately 32 per cent and 
strength recovery was not complete three-fourths of an hour later. 








Strength Decrements from Carrying Army Packs 255 


In the present study, therefore, the strength loss from marching with 
various loads carried in different positions on the body and in different army 
packs was determined for the muscles primarily involved. The strength 
decrement technique identifies muscle groups affected by localized fatigue; 
and also may indicate general body fatigue by combining the strength losses 
of all muscles so affected. 

2. The study was designed for field conditions with the actual load and 
load distribution on the body of the rifleman. Thus, it is recognized that the 
foot soldier in the field is not limited to marching on a smooth surface (as 
on a treadmill), but must often travel long distances over rough terrain; and 
does not carry the entire weight in a lump on the back (as frequently done 
in such studies), but distributes parts of the load to other areas of the body. 
These same observations are true for hikers on outdoor trails. 


Research Procedures 


FIELD CONDITIONS 

1. The field course was 1.25 miles long over moderately rough and hilly 
terrain. Each march was 7.5 miles long, requiring six repititions of the course. 

2. The rate of march was 2.5 miles in 50 min.; a ten-minute stop per- 
mitted time for re-testing the subjects during the march. A pace-control 
system was devised to insure adherence to this rate of march. 

3. The subjects were provided with regulation army clothing, combat 
boots, equipment, packs, and M-1 rifles. The subjects were dressed and the 
load distributed on the person, with the exception of the packs which were 
varied, in accordance with Army regulations. This arrangement is shown 
in Figure I, with combat pack carried low on the back. 

4. An exploratory march and six experimental marches were conducted. 
The specific conditions for the six experimental marches were as follows:? 


March No. 1: Combat pack without rifle; weight 29 lb.; pack carried high on back; 
winter clothing. 

March No. 2: Same as March No. 1, except with rifle; weight, 41 lb. 

March No. 3: Same as March No. 2, except with pack carried low on back. 

March No. 4: No pack, load, or equipment; summer fatigues. 

March No, 5: Rucksack with rifles and grenades; weight, 41 lb.; summer fatigues. 

March No. 6: Same as March No. 5, except with 61 lb. (which equals the field-load 
weight of the rifleman, but not the pack carried). 


STRENGTH TESTING 
1. Strength Tests: The muscle groups tested* were those considered to be: 


(a) under direct stress in supporting the packs, (b) involved in stabilizing the 
rest of the body in relation to the pack, (c) primarily activated in marching, 





2The weights given include al! equipment carried by the subjects, not alone the weight 
of the pack. 

8For assistance in the selection of these muscle groups, acknowledgments are made to 
Dr. Earl C. Elkins, Section on Physical Medicine, Mayo Clinic, and to Dr. F. A. Helle- 
brandt, Head, Department of Physical Medicine and Rehabilitation, University of Mli- 
nois Research and Educational Hospitals, 
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Fic. I. Military Equipment: Pack Carried Low on Back. 


and (d) affected by interference with circulation caused by pressure from 
pack straps. The following 11 muscle groups were tested for strength at the 
beginning and at the end of each 2.5 miles of march, in the order given: 
right grip, left grip, ankle plantar flexors, pectorals, knee extensors, knee 
flexors, hip flexors, hip extensors, trunk flexors, trunk extensors, and shoulder 
Aevators. With the exception of grip strength, whjch was measured with a 
manuometer, the strength tests were of the cable-tension type, previously 
developed in the Springfield Laboratory.‘ 

2. Strength Decrement Index: The Strength Decrement Index (SDI) was 
the basic score used in this research. It ie computed as follows: 


Sp _ S. 
SDI = 





xX 100 
Sp 
In which: S, = Before march strength 
S. = After march strength 


4Detailed data and graphs on this research will be found in references 5 and 6. Also, 
included in these references are descriptions of all tests and special devices used in this 
study. % 
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3. Testers: Prior to the actual collection of research data, the graduate 
students serving as research assistants were carefully trained in the strength 
testing techniques. They were required to demonstrate testing competence by 
obtaining objectivity coefficients by the test-retest method with different testers 
approximately equal to those established for the tests. The coefficients thus 
obtained ranged from .86 to .98. 


SUBJECTS 

The subjects were 30 Springfield College students majoring in physical 
education. The age range was between 18 and 26 years. The following 
median scores will further describe this group: height, 69.9 in.; weight, 167 
lbs.; Physical Fitness Index, 104; and Strength Index, 3013. 

Before the pack-carrying experiment commenced, the subjects were taken 
to the marching area in small groups and were oriented to the purpose of the 
study. Then, they were taken over the field course with packs, so as to become 
familiar with the trail and the rate of march which would be necessary. The 
strength testing sequence was explained and they went through the process 
of test administration. 


Results 
A summary of the results of the pack-carrying study appears in Table 1. 


Both mean SDI’s and t ratios above 1.50 (representing the significance be- 
tween pre- and post-march means) are given for each muscle group on each 


TABLE 1 
Summarization of Strength Decrement Indices for the Six Experimental Marches 



























































Muscle Groupe 
Be 4 f z ‘ 
38 uf ue f i £ é sf 
$1 Go | fe | ef | of | Sb] 88 | G5 
ne & i) = ed te Mm | <k 
Marches Mean ms Lt ; ie ae 
1. 29-Ib. SDI 9.31 | 11.19| 364] 634] 13.78| 5.19| 3.57 
high com- 
bat pack} t 1.72 2.00 2.29 1.74 4.20 1.53 | 1.65 
2. 41-1b. SDI 3.18} 3.70| 421! 5.04] 13.01! 5.48| 3.64 
high com- 
bat pack| 171| 250| 250] 230| 459| 3.54| 2.18 
3. 411b. | SDI | 10.28, 430| 809! 597| 4.28| 1030, 4.99| 4.49 
low com- | 
bat pack; t 5.30 157; 175) 2.16; 243, 3.13 | 2.00 | 2.19 
4. No load | SDI | 7.84! 6.40 | | | | 4.68 
t | 200} 171 | | 2.13 
5. 41-lb. | SDI | | 4.36) 3.46. 3.29 
rucksack t | | 1.60 | 1.62 | 2.14 
SSS “" —— eens ee SSS 
6. 6llb. | SDI | 1010) 808| 1464| 535| 5.75| 6.46 | | 6.58 
rucksack} t_ | 2.79| 467| 453| 240| 167| 247| | 2.86 
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march.® The most significant figures on this table are the ¢’s. The mean SDI’s 
are affected by any unusual situation, causing an extreme score, which could 
inadvertently happen during a given march. Such extreme scores increase 
means disproportionately to the importance of single cases. The t is not so 
affected, as the extreme scores also produce greater standard deviations, 
which in turn are reflected in larger standard errors for these measures. 

Eight muscle groups, all except the right and left grips and the pectorals, 
had t’s above 1.50 for at least three of the marches. The exceptions noted did 
not produce statistically significant results for any of the pack carries, so are 
eliminated from Table 1 and from further consideration in this report. The 
grip strength tests were included in this research as it was thought that the 
pressure from the pack straps in carrying military loads might reduce their 
strength as a consequence of interference with blood circulation to the arms 
and hands. However, such results were not obtained from these tests. 


Observations of the strength losses resulting from the various marches 
are as follows: 


1. Carrying a 41-lb. load with the rucksack resulted in the least over-all 
strength loss of the muscle groups tested. Only three muscle groups, all 
located in the lower body (marching muscles), had strength losses which 
were statistically significant near and beyond the .05 level of confidence: hip 
flexors, hip extensors, and ankle plantar flexors. 

2. With the rucksack, also, the effect on total strength loss with the 61-lb. 
load was approximately the same as for the combat pack with 41 lbs. carried 
low on the back. However, the proportionate amount of loss for the different 
muscle groups varied: with the rucksack, greater loss was found for the 
trunk flexors and extensors; with the combat pack, the larger decrements 
were for the shoulder elevators and the knee extensors. 


3. Very little difference in over-all strength loss was found between the 
41-lb. high and low combat pack marches, although there were differences in 
the muscles affected. The greater losses were: knee flexors and extensors for 
the high pack; and shoulder elevators for the low pack. 


4. With no load, three muscle groups were involved, the trunk flexors and 
extensors and the ankle plantar flexors. From observing the results of the 
other marches, loads up to 41 lb. did not affect the mean SDI’s of these 
muscle groups beyond those involved in marching the 7.5 miles. However, 
when the load was increased beyond this weight, definite additional stress 
was found, as shown for the 61-lb. rucksaek march. 


5Significant t’s, when probability is confined to one tail of the normal distribution, are: 
1.70 at the .05 level of confidence; and 2.47 at the .01 level. The standard error formula 
for correlated groups was used in these computations. Inasmuch as the subjects utilized 
in this study did not constitute a random sample, but, rather, were physically superior to 
the normal population, the t-ratios should be interpreted cautiously. 
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5. The knee flexor muscles had the greatest total strength loss on the six 
experimental marches. However, the strain on these muscles was centered 
primarily in the first three and last marches, with t’s of 4.20, 4.59, 3.13, and 
2.47 respectively. As was subsequently found, the amount of knee flexion 
loss for the final march, when the subjects were in their best marching 
condition, is the most significant single SDI in correlating with motor-physical 
fitness tests. The correlations were: —.53, with Strength Index; —.49, with 
AAF Physical Fitness Test; —.45, with Navy Standard Physical Fitness 
Index. In interpreting these correlations, r’s of .35 and .45 are significant 
at the .05 and .01 levels of confidence respectively. 


6. For the shoulder elevator muscles, only two marches produced statis- 
tically significant mean SDI’s, the 41-lb. low combat pack and the 61-lb. 
rucksack. In both of these marches, the load is carried low but with straps 
over the shoulders, thus resulting in a long pull on the shoulder muscles. 
The effect of this pull was not noted with the rucksack (41 lb.) until the 
weight became much greater (61 lb.), which may be accounted for by the 
brace construction of this pack providing support at the hips. 

7. The knee extensors had the least mean strength losses of the eight 
muscle groups. They followed much the same pattern, but in lesser degree, 
as did the knee flexors, except no significant mean strength loss was found 
for the 61-lb. rucksack march. This exception may be due to the better 
condition of the subjects on the final march in the series. 


8. Mean strength losses of over 10 per cent were found in seven muscle 
groups during the marches. The SDI’s were as follows: 14.64, trunk 
extensors with 6]-lb. rucksack; 13.78 and 13.01, knee flexors with 29-lb. 
and 41-lb. high combat packs respectively; 11.19, trunk extensors, with 
29-lb. high combat pack; and 10.28 and 10.10, shoulder elevators, with 
41-lb. low combat pack and 61-lb. rucksack respectively. 


Tables, graphs, and discussions of the mean SDI’s for each muscle group 
at 2.5, 5.0, and 7.5 miles of marching are contained in the original reports 
by the authors (5, 6). The presentation of so much detailed information 
is not made in this article. To show the nature of these data, however, the 
mean strength decrements of the knee flexor muscles are presented in Figure 
Il. 

Nearly the full strength decrement effect on the knee flexor muscles was 
felt in the first 2.5 miles of the first two marches, the 29-lb. and 41-lb. high 
combat packs; they continued at about the same level for the balance of the 
marches. With the 41-lb. low combat pack, the decrement line rose sharply 
at 2.5 miles (about two-thirds of the total loss), after which it continued 
as a straight line rise but at a lessened rate. The 61-lb. rucksack march 
produced an increase in strength at 2.5 miles, followed by a rapid decline 
to 7.5 miles; however, the final SDI of 6.46 is not significant at the .05 level 
of confidence. The decrements for the other two marches—no pack and 
41-lb. rucksack—were slight and not statistically significant. 
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Second Pack-Carrying Study’ 


In the preceding pack-carrying study, no effort was made to control the 
physical conditioning effect on the subjects from repeated 7.5 mile marches. 
That such conditioning took place is readily concluded from the following 
two sources of evidence: 

1, Increase in strength of the muscle groups tested: The mean pre-march 
strength scores for each muscle group tested were computed before the first 
and last marches. Thus, five 7.5-mile marches intervened between tests. The 
results appear in Table 2. 























TABLE 2 
Increase in Strength of Involved Muscles as a Result of Five 7.5-Mile Military Marches 
Means before | 
Muscle Group First March | Last March Diff. oD | t 
Trunk extensors 124.6 164.5 39.9 855 | 4.67 
NR eee 121.6 157.9 36.3 8.82 4.12 
Shoulder elevators | 452.3 524.0 717 | 2247 | 3.19 
ge RRS 155.5 170.1 14.6 5.99 | 2.44 
Ankle plantar flexors_.__ 370.8 388.9 18.1 11.98 | 151 
Hip extensors ———_..___ 111.4 113.2 18 6.26 | .29 
Knee flexors 118.2 19s | . 
Knee extensors ——.._.__. 2170 =| 2178 | 8 | 








6For this phase of the study, acknowledgments are made to the following graduate 
students who assisted in the collection and analysis of the necessary test data: Douglas A. 
Bailey, Theodore Fabyan, Frank A. Godin, Allan H. Goodyear, Robert E. Hodgdon, Karl 
K. Klein, Adrian J. Marinelli, Larry E. McClements, Frank M. Robinson, David M. Sea- 
mans, and Vincent Trotta, Jr. 
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The increase in strength of the involved muscles was definite. The differ- 
ences between the means for three muscle groups were statistically significant 
beyond the .01 level of confidence: trunk extensors (39.9 lb.), trunk flexors 
(36.3 lb.), and shoulder elevators (71.7 lb.). The gain in mean strength for 
the hip flexors (14.6 lb.) was nearly significant at the .01 level; and the 
mean gain for the ankle plantar flexors (18.1 lb.) was noticeable, but did 
not reach the .05 level. 

Furthermore, the over-all percentage in mean loss of strength was less 
and generally occurred later on the final march (7.5 miles) than on the 
first march (2.5 to 5.0 miles). This observation, however, should be inter- 
preted cautiously as the conditions were not identical, the packs being 
different for the two marches. However, the load was 20 lb. heavier for the 
final march, which might logically have caused greater strength losses. 

2. Increase in motor-physical fitness test scores: Before the exploratory 
march and after the last experimental march (seven marches thus interven- 
ing), several motor-physical fitness test batteries were administered to the 
subjects. The mean scores for this testing appear in Table 3. 


TABLE 3 
Comparison of Means on Motor-Physical Fitness Tests Before and After Seven Military 
Pack-Carrying Marches 




















Initial Final | 
Motor-Physical Fitness Tests Mean Mean Diff. | oD t 
1. Army Physical Efficiency Test 253 281 28 | 4.6 6.1 
2. AAF Physical Fitness Test 56 59 3 6 5.0 
3. Navy Standard Physical Fitness 
Aas Bek, SELENE SEER £5 260 275 yo" eae ae 
4. Physical Fitness Index... Lali 104 113 Sit Bier 
5. Strength Index _....-___ SE. 3084 3407 323. | 1420 | 23 
6. McCloy’s Arm Strength Score... 581 | 591 | 10 We 








The means for all motor fitness tests increased during the pack-carrying 
study. All increases but two were significant beyond the .01 level of confi- 
dence. The exceptions are: Strength Index with a t of 2.3, which is nearly 
significant at the .01 level; and McCloy’s Arm Strength Score with a t of 1.64, 
which nearly reaches the .05 level. All three of the armed forces motor fitness 
tests showed definite gains, especally the Army Physical Efficiency Test, which 
had a t of 6.1. The final mean of 113 on the Physical Fitness Index test is a 
high score, as 115 is the third quartile for normal populations. 


PROCEDURES 

In the above study, the physical condition of the subjects might logically 
be a factor in the strength decrement results from the pack-carrying marches. 
Inasmuch as each subject executed a set performance (marching 7.5 miles at 
a rate of 2.5 miles in 50 minutes), improved condition could be reflected in 
less strength loss. Had the performance been competitive (i.e., to march the 
course in the shortest possible time), the SDI’s could have remained much 
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the same, with improved condition being expressed in faster time. In fact, 
this latter hypothesis has been verified in a recent swimming experiment con- 
ducted at the University of Oregon (7). 

Therefore, because physical conditioning of the subjects resulting from re- 
peated marches was uncontrolled in the initial research, a second study was 
undertaken which attempted to neutralize this effect on pack-carrying evalua- 
tion. Two of the former packs were included: the combat pack carried high 
on the back and the rucksack. In addition, a Quartermaster experimental 
pack, designed to locate most of the weight around the waist, was included. 

All weights were 41 lb. As before, the marching distance was 7.5 miles and 
the rate of march was 2.5 miles in 50 minutes. The subjects were 40 Spring- 
field College students. Statistically significant t’s for this N are: 1.68 at the 
.05 level of confidence; and 2.42 at the .01 level. The strength tests utilized in 
their order of administration were: shoulder elevation, neck extension, knee 
flexion, knee extension, hip flexion, hip extension, trunk flexion, and trunk 
extension. 

The conditioning effect on the strength decrement of the muscles resulting 
from the marches was equalized by employment of the following Latin-square 
distribution, with the subjects divided into three groups of 13, 13, and 14 
subjects, designated A, B, and C: 


March Combat Pack Rucksack Experimental Pack 
First - erieeeee A B C 

_ SS EEE eet B M A 
Third... SEES «OV Cc A B 
RESULTS 


The strength decrements for the marches carrying the three military packs 
are summarized in Table 4. As in Table 1, both the SDI’s and the t’s for the 
difference between pre-march means and the means for the greatest SDI dur- 
ing the march are presented. 




















TABLE 4 
Summarization of Strength Decrement Indices for the Three Pack-Carrying Marches 
___ Combat Pack __ | Experimental Pack ____ Rucksack 
Muscle Groups _ spr | Se SE t SDI “ 
Shoulder elevators | 8.99 | 268 | 603 | 154 | 4.35 1.86 
Neck extensors | 11.25 | 419 | 3.72 | 88 7.53 3.11 
ete Gees eee OD | | RO: | OO +} CUkS 10.59 | 3.46 
Knee extensors —— —| 5.85 1.21 2.09 65 723 | 252 
Hip flexors ae 1 | Ol Bm |; OL 435 | 212 
Hip extensors _..___. | 459 | 180 | 620 | 1.97 3.88 | 239 
Trunk flexors .....|__ 2.16 .70 4.15 ee i. 788 3.81 


Trunk extensors | 640 | 2.94 a8 | 222 | «al2 | 219 





Observations of the strength losses occurring on the various marches are 
as follows: 

1. Carrying the 41-lb. load by means of the QM experimental pack re- 
sulted in the least over-all strength loss of the muscle groups tested; the total 
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was 10.96. Only two muscle groups were significantly affected in this march, 
between the .05 and the .01 levels: the trunk extensors with a t of 2.22 
(8.43% loss) ; and hip extensors with a t of 1.97 (6.2% loss). The strength 
loss for the shoulder elevators was noticeable but did not reach the .05 level 
(t of 1.54). 

2. With the combat pack carried high, five muscle groups exhibited sig- 
nificant losses in strength, four of which were beyond the .01 level of confi- 
dence. The muscle groups were: neck extensors with a t of 4.19 (11.25% 
loss) ; trunk extensors with a t of 2.94 (6.40% loss) ; shoulder elevators with 
a t of 2.68 (8.99% loss); knee flexors with a t of 2.62 (9.38% loss); and 
hip extensors with a t of 1.80 (4.59% loss). The total t was 16.74, which is 
53 per cent greater than for the experimental pack. 

3. In this series, the march with the rucksack resulted in the greatest per- 
centage of loss, approximately twice as much as for the experimental pack 
and about 25 per cent greater than for the combat pack. All eight muscle 
groups tested were significantly affected by strength losses. 

4. The hip flexors and the trunk extensors were the only muscle groups 
significantly affected on all three experimental marches, Those muscle groups 
with significant differences on two marches were: shoulder elevators, neck 
extensors, and knee flexors. 

5. The neck extensor muscle group was added to the battery of strength 
tests in this study, due to discomfort in this area experienced by the subjects 
in the earlier series. Considerable stress on these muscles was found for the 
combat pack and the rucksack, with t’s of 4.19 (11.25%) and 3.11 (7.53%) 
respectively. For the QM experimental pack, however, the strain on this 
muscle group was negligible (t of .88). 


Discussion 

Two independent pack-carrying studies were conducted in which the 
strength losses of involved muscles from 7.5-mile military marches were de- 
termined. In the first study, an exploratory and six experimental marches 
were held. During these marches, the subjects improved considerably in the 
strength of the muscles tested and in motor-physical fitness. No control of 
this factor was exercised, so that the conditioning benefits from the marches 
favored the last ones in which the rucksack was used. In the second study, 
the conditioning effect was equalized by employment of a Latin-square distri- 
bution. However, only three marches were held in this series, so the subjects 
did not achieve conditioning improvement to the same degree as in the first 
study. In fact, the condition of these subjects from marching was about at 
the level of the first experimental march (actually the second march) of the 
first series. 

In the first series of marches, the rucksack had a clear advantage in car- 
rying military loads: not until the weight was 61 lb. were strength losses ap- 
preciable. Contrarily, in the second series, marching with the rucksack re 
sulted in greater strength losses than with the combat pack, both loads being 
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41 lb. The difference is attributed to the condition of the subjects on the two 
marches as discussed above. This contention is further supported by the ob- 
servation that the SDI’s for the rucksack march with 61 lb., when the sub- 
jects were in their best marching condition, are comparable to the SDI’s for 
the rucksack with 41 lb., when the subjects were unconditioned for marching. 
For the second series of marches, the QM experimental pack was clearly 
superior, resulting in considerably less strength decrements when marching 
7.5 miles with a 41-lb. load. ° 
In the first series of marches, the shoulder elevator muscles did not show a 
significant strength loss when the combat pack was carried high on the back; 
in the second series, the loss of strength in these muscles was significant at 
the .01 level of confidence. The subjects were in about the same stage of 
marching condition. This discrepancy can be explained by the order in 
which the tests were administered after marching: in the first series, these 
muscles were tested last, thus permitting about eight minutes of recovery 
time; in the second series, they were tested first with very little rest of these 
muscles allowed. Other similar discrepancies in SDI differences between the 
two series arise from like situations. 
A listing of the muscle groups most seriously affected with strength losses 
on the various marches in the two series (t’s of 3.00 and over) is as follows: 
First Series 
29-lb. high combat pack: knee flexors (13.78%). 
41-lb. high combat pack: knee flexors (13.01%) and knee extensors (5.48%). 
41-lb. low combat pack: shoulder elevators (10.28%) and knee flexors (10.30%). 
No load: none. 
41-lb. rucksack: none. 
61-lb. rucksack: trunk flexors (8.08%) and trunk extensors (14.64%). 
Second Series 
41-lb. high combat pack; neck extensors (11.25%). 
41-lb. rucksack: trunk flexors (7.73%), knee flexors (10.59%), and neck extensors 
. (753%). 
41-lb. QM experimental pack: none 
The muscle groups showing greatest strength losses for the nine marches in 
the two series were: trunk extensors in eight marches; hip extensors in seven 
marches; and knee flexors in six marches. The ankle plantar flexors were 
included in the first series only; as they showed significant strength losses on 
all six of these marches, it is safe to classify them among the frequently af- 
fected muscles. The neck extensors appeared in two of the three marches in t 
the second series, but were not tested in series one. Of the other muscle 
groups, the trunk flexors and the hip flexors were significantly affected in five 
marches each; the shoulder elevators and knee extensors in four each. 









































Conclusions 
Observations on the strength losses resulting from the various marches with 
military packs performed in this study are as follows: 
1. Military marches with the Quartermaster experimental pack resulted in 
the least muscular fatigue, as reflected in Strength Decrement Indices of the 
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muscle groups involved. Only strength losses in two muscle groups—trunk 
extensors and hip extensors—were statistically significant. 

2. In the second series of marches, when the subjects were unconditioned 
for marching, the combat pack carried high on the back was superior to the 
rucksack. Three muscle groups were significantly affected by strength loss 
with this pack: neck extensors, trunk extensors, and shoulder elevators. 

3. When conditioned, the subjects experienced much less strength loss in 
marching with military packs, as indicated by the lower Strength Decrement 
Indices resulting from the seventh march of the first series. On this march, 
the subjects carried a rucksack with a 61-lb. load. All of the muscle groups 
tested, except the knee extensors, experienced significant strength losses from 
this march, with the trunk flexors and extensors having the greatest propor- 
tionate loss. 

4. Very little difference existed in over-all strength loss resulted between 
carrying the combat pack high or low on the back. However, the muscles 
affected differed somewhat: the shoulder elevators had greater loss with the 
low pack; and the knee flexors and extensors, with the high pack. 
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Abstract 


The strength loss effects on the muscles of the lower leg and ankle resulting from wear- 
ing three types of army boots and shoes on 7.5 mile marches carrying combat packs were 
determined. Military marches with the combat boot resulted in the least fatigue of the 
muscle groups studied; the rubber insulated boot was next favored; and strength loss 
with the quarter-cut shoe was the greatest. The muscle group most affected by propor- 
tionate strength losses for the three footwear marches was the ankle everters; and, then, 
in order; the ankle plantar flexors, the ankle inverters, and the ankle dorsi flexors. 


AMONG THE INCREASING interests of physical education and recreation 
today are hiking and camping in the out-of-doors. The long hike with pack 
over woodland trails has become an enjoyable leisure-time activity for many 
people. A problem of vital importance to personal comfort on such hikes is 
the type of footwear worn. This study deals with an evaluation of the mus- 
cular fatigue effects from wearing various boots and shoes on a standardized 
hike. 

In another study by the authors (2), the strength decrements of muscles 
primarily involved in carrying army packs on military marches were deter- 
mined. The marches were 7.5 miles in length; and the rate of march was 2.5 
miles in 50 min. The subjects were supplied with regulation army clothing, 
equipment, and rifles. The packs were: 41-lb. combat pack carried high on 
the back, the same pack carried low on the back, 41-lb. rucksack, 61-lb. ruck- 
sack, and 41-lb. experimental pack (load located primarily around the waist). 
The strength decrements of 12 muscle groups were measured: right grip, left 
grip, shoulder elevators, pectorals, neck extensors, trunk flexors and extensors, 
hip flexors and extensors, knee flexors and extensors, and ankle plantar flex- 
ors. The muscle groups under greatest stress from fatigue were identified for 
each pack; and the over-all effect was determined by combining the strength 
losses for all affected muscles. The march with the experimental pack resulted 


1This study was conducted at Springfield College under subsidy from the Climatic Re- 
search Laboratory, Quartermaster Corps, Department of the Army, Natick, Massachu- 
setts. Acknowledgments are made to the following graduate assistants who assisted in 
the collection and analysis of the necessary test data: Douglas A. Bailey, Theodore 
Fabyan, Frank A. Godin, Allan H. Goodyear, Robert E. Hodgdon, David M. Seamans, 
and Vincent Trotta, Jr, 
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in the least fatigue, as tested by strength decrements, followed in order by the 
combat pack and the rucksack. The physical condition of the subjects im- 
proved considerably as a result of repeated pack-carrying marches; and this 
factor was reflected in lessened strength losses for the late marches in the 
series. 

The present investigation was designed to compare the strength loss effects 
on the muscles of the lower leg and ankle resulting from wearing three types 
of army boots and shoes on a uniform pack-carrying march. 


Research Procedures 


The research procedures applied in this investigation are similar to those 
described for the evaluation of army packs. Inasmuch as this report is also 
published in the Research Quarterly (2), a summary of this process only will 
be given here, as follows: 


1. The following types of army footwear were included in the study: com- 
bat boot, rubber insulated boot, and quarter-cut shoe. The subjects completed 
three marches, wearing a different boot or shoe each time. The conditioning 
effect on the strength of the muscles resulting from repeated marches was 
equalized by employment of the following Latin-square distribution, with the 
subjects divided into three equal groups, designated A, B, and C: 


March Combat Boot Insulated Boot Quarter-cut Shoe 


First - scccmrpubibaiaiacamaba tee A B G 
Second " , B ¢ A 
; | Sr: Cc A B 


2. The marching distance was 7.5 miles over moderately rough terrain. 
The rate of march was 2.5 miles in 50 min.; a 10-min. stop at the end of each 
2.5 miles permitted time for strength re-testing periodically during the 
marches. The same pace-control system used in the pack-carrying study in- 
sured adherence to the rate of march. 


3. The combat pack, located high on the back, was carried on all marches. 
The total weight of the load was 41 lb., which includes the pack, rifle, ammu- 
nition, and other equipment carried by the rifleman. The subjects were 
dressed in accordance with army regulations, and the load was distributed on 
the person as it would be carried in the field. 


4. The subjects were 29 Springfield College students majoring in physical 
education. They ranged in age from 18 to 26 years. The following median 
scores will further describe this group: height, 69 in.; weight, 155 lb.; 
Physical Fitness Index, 107; Strength Index, 2900. 


5. Prior to the actual collection of test data, the graduate assistants serving 
as research assistants were carefully trained in testing techniques. They were 
required to demonstrate testing competence by obtaining objectivity coefhi- 
cients by the test-retest method with different testers approximately equal to 
those established for the tests. 
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6. The strength tests were administered each time in the same order, from 
weak to strong movements, as follows: ankle inverters, ankle everters, ankle 
dorsi flexors, and ankle plantar flexors. Clarke’s cable-tension tests (1) were 
utilized to measure the strength of these muscle groups. 

7. The Strength Decrement Index (SDI) was computed for each muscle 
group on each march at 2.5, 5,0, and 7.5 miles. This index is computed as 
follows: 

Sp ages S. 
SDI = x 100 
Sp 
In which: S, = Before march strength 
S, = After march strength 


8. In determining the statistical significance of the difference between pre- 
march and post-march means, the null hypothesis based upon “Student’s” 
distribution was used. For 29 cases, significant t’s, when probability is con- 
fined to one tail of the normal distribution, are: 1.70 at the .05 level of con- 
fidence; and 2.47 at the .01 level (3). Inasmuch as the subjects utilized in 
this study did not constitute a random sample, but, rather, were physically 
superior to the normal population, the t-ratios should be interpreted cautiously. 


Results 


ANKLE INVERTERS 

The SDI’s obtained on the marches for the ankle inverter muscles are 
shown in Figure I. Observations on these results are: 

1. The march in which the combat boot was worn had the least effect on 
strength loss of this muscle group. In fact, they increased in strength scores 
at 2.5 and 5.0 miles; and had just reached their pre-march mean at 7.5 miles. 
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Fic. I, Ankle Inversion Mean SDI’s, Footwear Experiment (Data: 1953). 
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2. With the rubber insulated boot, the subjects did not experience fatigue 
of these muscles at 2.5 miles. However, fatigué iricreased for the balance of 
the march (almost a straight-line rise) to a strength loss of 9.5 per cent. The 
difference between the pre-march mean and the mean at 7.5 miles was 3.2 lb.; 
the t was 1.97, which is significant beyond the .05 level of confidence. 

3, For the quarter-cut shoes, fatigue of the ankle inverter muscles was felt 
within the first 2.5 miles, a strength loss of 5.0 per cent. Little change was 
noted during the remainder of the march. The t for the difference between 


the pre- and post-march strength means was 2.11, indicating significance be- 
tween the .05 and .01 levels. 


ANKLE EVERTERS 


The SDI’s for the ankle everter muscles on the three footwear marches 
appear in Figure II]. Comments on these results are: 

1. Wearing the rubber insulated boot resulted in the least strength loss for 
these muscles; there was a slight increase in strength at 2.5 miles, at which 
point a steady decline followed for the balance of the march. At 7.5 miles, 
the SDI approached: but did not reach the .05 level of confidence (t of 1.62). 

2. With the combat boot, the decline in strength at 5.0 miles was signifi- 
cant at the .05 level; the t was 1.77. 

3. The quarter-cut shoe caused by far the greatest strength loss for the 
ankle everter muscles at 7.5 miles. The SDI was 11.5, with a t of 3.72, which 
is significant beyond the .01 level. 















































ne Poe 6.0 18 


Fic. Il. Ankle Eversion Mean SDI’s, Footwear Experiment. (Data: 1953). 
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ANKLE DORSI FLEXORS 


In Figure III, the SDI’s for the ankle dorsi flexor muscles are presented. 
The strength of these muscles improved on all three marches. However, only 
one increase was statistically significant: the rubber insulated boot at 5.0 miles 
(t of 2.63; significant at the .01 level). ' 
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Fic. IIL, Ankle Dorsal Flexion, Footwear Experiment (Data: 1953). 
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ANKLE PLANTAR FLEXORS 


The SDI’s obtained on the marches for the ankle plantar flexor muscles ap- 
pear in Figure 1V. Appropriate observations are: 
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Fic. IV. Ankle Plantar Flexion, Footwear Experiment (Data: 1953). 
———— == rubber insulated; —- —- —- = combat; — - -- - — = oxford 
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1. No stress on these muscles was noticeable at 2.5 miles for all three types 
of footwear. At 7.5 miles, however, significant mean strength losses were 
found with the rubber insulated boot and the quarter-cut shoe; the respective 
t’s were 1.84 and 1.89, both being significant beyond the .05 level of con- 
fidence. 

2. The loss in the strength of the ankle plantar muscles for the march in 
which the combat boot was worn was not statistically significant (t of 1.37). 
These results are not in agreement with those of the pack-carrying study, 
where strength losses of these muscles were significant for all six marches of 
the first series. However, the differences between the two series are within the 
range of chance probability. 

A general summarization of strength decrements for the three marches 
wearing different army footwear appears in Table 1. Inasmuch as mean 
strength losses of the ankle dorsi flexor muscles were not found for any of 
the marches, these data are omitted from this table. Both the mean SDI’s and 
the t’s for the differences between pre- and post-march strength means are 
given for each muscle group on each march. The most significant figures on 
this table are the t’s. The mean SDI’s are affected by any unusual situations 
resulting in extreme scores, which could happen inadvertently during any 
march; such scores increase means disproportionately to the value of single 
cases. However, these scores also increase the variability (standard devia- 
tions) of the data, which, in turn, increase the standard errors of these mea- 
sures. Statistically, then, low t’s may be found with high means, as for the 
ankle everters on the march with the quarter-cut shoes. Thus, greater con- 
fidence can be placed in the t values. 

















. TABLE 1 
Summarization of Strength Decrement Indices for the Three Footwear Marches 
Th ee ae ecru owe a ae ee 
| Ankle Ankle Plantar 
Footwear March a Mean | _ Inverters oe Everters — ry Flexors — | Totals a 
Combat boot ......_-| SDI | 000 | 539 | 9321 | 8.60 
t Se eee eee 3.14 
Rubber insulated boot SDI | 9.43 2.41 5.61 | 17.45 
t 1.97 1.62 1.84 | 5.43 
Quarter-cut shoe .___| SDI | 64 | 1164 | 455 | 2259 
t 2.11 3.72 | 1.89 | 7.72 
Total | SDI | 158 | 194 | 1337 | 4866 
t 408 | 7.11 | __ 5.10 | 16.29 





Conclusions 

Observations on the strength losses resulting from the various marches 
performed in this study are as follows: 

1. Military marches with the combat boot resulted in the least fatigue of ~ 
the muscles of the lower leg and ankle, as reflected in Strength Decrement 
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Indices of the muscle groups affected. The lowest total SDI (8.60) was ob- 
tained for this march. Only strength losses in one muscle group, the ankle 
everters, were statistically significant at the .05 level of confidence (t of 1.77). 

2. The rubber insulated boot was the next favored of the army footwear 
studied. The total strength loss for this boot was approximately 75 per cent 
greater than for the march with the combat boot. All three muscle groups 
showed strength losses which were significant at the .05 level or nearly so. 

3. Strength loss with the quarter-cut shoe was the greatest of the three 
marches. The total strength loss was nearly two and one-half times greater 
than for the combat boot march; and approximately 40 per cent greater than 
for the march with the rubber insulated boot. The strength losses for two 
muscle groups—ankle inverters and ankle plantar flexors—were significant 
beyond the .05 level of confidence; and the loss for one, the ankle everters, 
beyond the .01 level. 

4. The muscle group most affected by proportionate strength losses for 
three footwear marches was the ankle everters. Statistical significance was 
found for the different marches as follows: rubber insulated boot, nearly at 
the .05 level; combat boot, slightly beond the .05 level; and quarter-cut shoes, 
beyond the .01 level. 
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Abstract 


In this study, scores obtained from two administrations of the test of right grip given 
to college freshman physical education majors by trained and untrained testers were 
compared, 

Trained and untrained testers were equally effective in the administration of the test 
of right grip. No statistically significant differences were found between testers, between 
trials, or between the interaction of trials and testers, 


IN THE DECEMBER, 1953 issue of the Research Quarterly, Mathews (1) 
reported the results of a study which compared scores when PFI tests were 
given to non-oriented testers with scores resulting when the same subjects 
were oriented and the testers experienced. Statistically significant differences 
were found in all Physical Index texts in favor of the experienced testers, and 
the conclusions indicated the necessity for trained examiners and pre-test 
practice sessions. 

Because of the design of the experiment, obtained significant differences 
might have been the result of 1. the influence of the experience of the testers, 
2. the effect of practice or learning, or 3. a combination of interaction of both 
experience and learning. The design of the experiment does not permit a 
precise interpretation of the resultant scores. The only justifiable conclusion 
that could be made is that a significant difference between means was found. 
To conclude that the learning factor influenced the more valid scores, and 
practice sessions should precede the actual testing program may not be 
supported by further research. 

This particular design does not explain which of the above factors caused 
difference between the results. The influence or learning, or the influence of 
testers and practice should be investigated, as well as the influence of the 
training of testers. In order to further examine the effect of the training of 
testers and re-tester experience, the following related study was instigated. 


Purpose 

It is the purpose of this study to compare the scores obtained from the test 
of right grip given to college Freshman physical education majors by trained 
and untrained testers. 
Procedure 

The subjects utilized in this study were 40 Freshman physical education 
majors enrolled at Indiana University in a Freshman Major Activity Tech 
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nique Course. Fifty subjects were initially selected at random from a total 
of 63 major students, and assigned at random to ten groups. Five students 
constituted one group. The ten testers used in this study were male students 
enrolled in the graduate class, Measurement and Evaluation. 

Five of the testers selected at random were given detailed instructions con- 
cerning the administration of the test of right grip. These five individuals 
administered the test of right grip to unselected non-major Freshman stu- 
dents enrolled in the required physical education program. Using the ten- 
siometer, each of these testers administered the test of right grip a minimum 
of 35 times. During this practice period, questions concerning proper admin- 
istration were answered and a final training period was held. These five 
testers were then classified as being trained in the administration of the test 
of right grip. 

The remaining five graduate testers, none of whom had previously adminis- 
tered the test of right grip, were then read the discussion of proper adminis- 
tration of the test of right grip as given by Clarke (2, p. 158). The instru- 
ment to be used in the study was shown and the administration of the test 
with the use of the tensiometer was discussed. No further training was given 
to this group, and these five testers were designated as untrained testers. 

Each of the testers was then assigned at random to one of the ten groups 
of subjects. 

The test of right grip was then administered twice to each group by the 
assigned tester. The initial test was given on a Tuesday and the final test was 
given on Thursday of the same week. 

The instrument used was a tensiometer, model T5-6007-114 calibrated for a 
range from 0 to 200 pounds on +g in. size standard flexible aircraft cable. 
This tensiometer is guaranteed to read accurately within 5 per cent of the 
load, and with careful use closer accuracy could be obtained. 

An attachment model T579 for use with the tensiometer originally devel- 
oped by the Pacific Scientific Company in co-operation with Dr. Becktol of 
San Francisco makes it possible to test grip strength.' Figure I shows the 
tensiometer and the attachment. 

The dial of the tensiometer does not directly indicate pounds of pressure, 
but must be transformed by means of a Riser Load Table. Obtained scores 
in this study were read directly from the instrument and were not converted 
to pounds of pressure. 

Because of absences and injuries, one individual from each of five different 
groups did not take both tests. Of the five groups in which all subjects took 
both tests, one individual was dropped by random technique. This resulted 
in ten groups, four subjects to a group, for a total of 40 subjects. Five groups 
were administered the test by the designated trained testers and the remaining 
five groups by the untrained testers. 


1This instrument and attachment are available and may be obtained from the Pacific 
Scientific Company, 1430 Grande Vista Avenue, Los Angeles 23, California. 
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Bry. 4 
Fic. I. Tensiometer and attachment. 


Method of Analysis 


The experimental design used in this study is the analysis of variance ap- 
plied to combined groups of subjects, each group having two trials. This 
analysis of repeated measurements on several independent groups is described 


by Edwards (3, p. 288). 


Results 

The obtained coefficient of correlation for the test of right grip between 
the first and second trials was .849+.04. This reliability coefficient com- 
pares favorably with that reported by Clarke (2, p. 172). 


TABLE 1 
Analysis of Variance of Performance Scores of Right Grip of Two Groups of Subjects 
Tested by Trained and Untrained Testers with Two Trials for Each Group 


























| Mean | 
Source of Variation Sum of Squares | af Square | F 
Between testers 221.1125 | 1 | 221.1125 | 1.5195 
Between subjects in same | 
| peeanelegeanebers ae! RW 5529, 2750 | 38 145.5072 | 
Total between subjects 5750. 3675) | 39 
Between trials 12.0125 i 12.0125 | .9671 
Interaction trials and testers 1.5125 1 1.5125 1217 
Interaction pooled subjects | 
and trials 471.9750 38 12. 4203 
Total within subjects. ena | 40 
Grand Total 6235.8875| | 79 | 





- Table 1 contains the results of the analysis. The various sums of squares 
obtained by division of the total sum of squares, and the associated degrees 
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of freedom are presented in this table. The mean squares shown are the 
result of dividing the sums of squares by their respective degrees of freedom. 

The F test was used to find the significance of 1. the difference between 
testers, 2. the significance of trials with interaction of trials and testers, and 
3. the significance of trials and testers with the pooled interaction. 

For 1 and 38 degrees of freedom F requires a value of 4.10 at the 5-per-cent 
level for significance. The obtained variance ratios were not significant at the 
5-per-cent level. 


Conclusions 


It may be inferred that the testers (trained and untrained) were equally 
effective, since obtained differences can be accounted for in terms of random 
sampling. 

Trial means do not differ significantly, and obtained differences can be 
accounted for in terms of random sampling from a common population. 

The interaction of trials and testers does not influence the obtained results. 

These interpretations are made as a result of the above experiment utiliz- 
ing the test of right grip strength, and cannot be made in respect to other 
testing situations. 
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Abstract 


In this study Thurstone’s method of multiple factor analysis was utilized to factor the 
intercorrelations of 26 selected wartime fitness test items administered to 116 male Fresh- 
men enrolled at Indiana University. Four factors isolated were identified in terms of the 
resultant correlations between the individual test items and the factors. Isolated factors 
were identified with reservation as: I Arm Extensor Endurance, II Power of Leg Ex- 
tensors, III Power of Hip Extensors, and IV Power of Thigh Flexors. Low communalities 
of each of the test items indicated the presence of a specific factor or factors not isolated 
in the study. 


THE TERM physical fitness has many connotations but no commonly ac- 
cepted definition. This phrase has been consistently accepted throughout the 
field of physical education as one of its objectives. Lack of a clear, concise 
and acceptable definition of the term has caused many problems in the mea- 
surement of physical fitness. 


The program of physical education was utilized during World War II to 
develop and improve the physical condition of the members of the Armed 
Forces. Outstanding leaders in athletic and physical education were selected 
to plan and conduct physical fitness programs for the various branches of the 
services. National, state, and local physical fitness offices were set up to ad- 
vise and administer programs for civilians. As an outgrowth of these estab- 
lished programs, many tests of physical fitness were introduced by branches 
of the Armed Forces as well as by individuals in school and civilian pro- 
grams. Although these tests were designated as physical fitness tests, referring 
to the common interpretation of the term, they are in reality tests of motor 
fitness. 

Motor fitness is a quality which cannot be measured directly; however, it is 
measurable through the motor performance of the individual. There are many 
and varied types of motor performance, but the primary concern of wartime 
fitness is the large muscle type of movement over reasonable periods of time 
as indicated by selected wartime tests. 


One of the difficulties of measurement in the area of. fitness has been the 
lack of understanding of the degree of inter-relationships among the compo- 
nents of motor fitness. There is a need for the elimination of undesirable 
items and the establishment of an instrument to measure all common factors 
without overlapping and duplication. Selected physical tests supposedly mea- 
sure the qualities necessary for an efficient fighting man in time of war. 
What are these qualities? What factors comprise the physical condition de- 
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sired by the Armed Forces? Through the factor analysis technique, it should 
be possible to answer these questions to some extent and in so doing improve 
methods and techniques of obtaining this quality of physical condition. 

It is the purpose of this study to analyze a number of selected wartime 
physical fitness test items by Thurstone’s method of factor analysis. Through 
the factor analysis technique and partial and multiple correlation, it should 
be possible: (a) to discover the primary factors or components of the 
selected wartime physical fitness tests; (b) to discover relationships of these 
factors and the relative importance of each factor; (c) to devise a battery of 
items which would best measure common factors basic to the tests; (d) to 
determine the validity of the devised battery of tests; (e) to develop regres- 
sion equations for the prediction of each of the isolated factors; and (f) to 
develop regression equations for the best combination of all factors which 
would be considered as an index to measure all of the factors of wartime fit- 
ness tests with a high degree of validity. 


Procedure 


One hundred and seventy-four college Freshmen were used as subjects in 
this investigation, but only 116 of this number were used in the final analysis. 
All men who participated in the study were enrolled in the required physical 
education program. Records of individuals who failed to complete all tests 
were discarded. Reasons for failure to complete all tests were numerous: 
drop-outs, illness, and enlistment in the armed forces accounted for the ma- 
jority of cases discarded. The mean age of the 116 subjects was 18.5 years, 
the mean height was 5 ft. 10 in., and the mean weight 152.4 pounds. 

Even though many motivating forces were introduced, it was felt and ob- 
served that several students were not giving valid and reliable performances 
of their ability. The records of these individuals were not honored since the 
reliability of the items would be influenced by this type of behavior. 

The testing program was carried out in nine class periods throughout the 
semester with the regular program of course content being followed as closely 
as possible. Because of this method of administration, rather than consecu- 
tive administration of the tests, the “guinea pig complex” was not observed 
in the attitude displayed by students for these tests. 

The first class period was devoted to an explanation of the testing program 
and its purpose. The students were informed that the program was a part of 
the research being carried out by the department of physical education. 
Since all the subjects were already at or nearing the draft age, the per- 
formance of the tests would aid in the development of their physical condi- 
tion. Also, since this testing was included as a part of the course work, final 
marking would be influenced by performance during testing. The subjects 
were informed that, while the final mark did not depend upon how well they 
performed on the various items, it might be influenced by the completion of 
all test items and the co-operation and attitude previous to and during testing 


periods, 
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A two-minute warm-up period, consisting of three continuous exercises, 
was administered before each testing period with exercises standardized 
throughout the testing period. The exercises consisted of (a) side straddle 
hop, (b) in-place running, and (c) 10-sec. squat thrust. All test items were 
performed by subjects dressed in regulation gymnasium uniform consisting 
of shoes, socks, shorts, and shirts. 

Tests were administered with the aid of four graduate assistants majoring 
in physical education, and seven senior major assistants. All tests were ad- 
ministered on a competitive basis, which was an added stimulus for every 
individual to give his best effort. The purpose of each test was briefly ex- 
plained prior to the actual testing, and where necessary, practice in each 
event was allowed. Results of each test were interpreted to the students in 
order to gain interest and to inspire effort. 

The reliability of the test items used in this study was not obtained because 
of the large number of variables and the limitation of time for test admin- 
istration. It was imperative that all tests be administered during the first 
semester since the individuals comprising the sample would be separated by 
the second semester. 

Criteria for the selection of the test items were set up empirically by the 
writer. The criteria used for the selection of tests for this study were as 
follows: (a) They must be tests developed and used in the period of 
World War II. (b) They must be continuous test variables as apart from pass 
or fail items. (c) They must be items which could be administered to a 
large number of men using the available facilities at Indiana University. 
(d) They must be applicable to the age level of subjects. (This was included 
to eliminate fitness tests developed for elementary and junior high school 
students.) (e) They must be items which meet the commonly accepted 
criteria of validity, reliability, and objectivity. 

Analysis of the literature pertaining to fitness tests resulted in the em- 
pirical selection of test items which best fitted the established criterion. The 
31 variables selected for use in this study are as follows: 


. Sit-ups (fatigue) 
Pull-ups (fatigue) 

. 300-yard Shuttle Run 

. 60-second Squat Thrust 
Push-ups (fatigue) 
Squat Jumps (fatigue) 
. 100-yard Pick-a-back 

. 20-second Squat Thrust 
Running Broad Jump 
10. 16-pound Shot Put 

1]. 75-yard Pick-a-back 
12. Dodging Run 

13. 6-second Run 

14. Straddle Chins 

15. Vertical Jump 

16, Standing Broad Jump 
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17. Dips 

18. 250-yard Shuttle 

19. 2-minute Sit-ups 

20. 20-foot Rope Climb 

21. 100-yard Dash 
Standing Hop-step-jump 
75-yard Dash 

150-yard Dash 

220-yard Dash 

Hurdle Run 

Medicine Ball Put 

3 Standing Broad Jumps 
Leg Lifts 

300-yard Run 

200-yard Shuttle Run 


FSSENSERSER 


Physical fitness batteries used as a basis for the selection of test items were 
as follows: 


— 
. 


Army Air Force Test 

Navy Standard Physical Fitness Test 

Army Specialized Training Division Physical Efficiency Test 
Army Physical Efficiency Test 

Army Physical Efficiency Test, 1946 Revision 

Indiana University Motor Fitness Indices 

Larson’s Dynamic Strength Test 

U. S. Military Academy Physical Performance Test Batteries 
U. S. Office of Education—Achievement Standards 


SPN ANS eN 


Obtained raw scores of 31 physical tests administered to 116 male Fresh- 
men students were transformed into T-scores. T-scores of each of the 31 
items were added to form a composite score. The composite score, which 
was T-scored, was used as a criterion of physical fitness. A table of intercor- 
relations of all test variables, plus the correlations of all test variables with the 
criterion measure, was developed. International Business Machines were a 
merchanical aid in this procedure. Table 1 shows the inter-relationships of 
the test items. By inspection, this matrix revealed that several tests did not 
have high correlations with any other test. These tests: squat jumps, 20-sec. 
squat thrust, straddle chins, 20-ft. rope climb, and leg lifts were eliminated 
from the final analysis. These particular tests were rejected because the 
emergent factors are dependent upon the original correlation. Since the cor- 
relations of these tests were not high enough to change the results of the 
analysis, they were not included. 


The analysis of the 26 remaining items required the computation of four 
sets of factor loadings and four sets of factor residuals. The computation of 
the factor loadings and the factor residuals were considered complete when 
no residual was found to be significant at the 1-per-cent level of confidence. 
The unrotated factor loadings are given in Table 2. 
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TABLE 2 
Unrotated Factor Loadings 


























Item Ky Ks Ks K, hk? 

| Sl Sans s 
Sit-ups | 4344 | —.2422 | —.3356 | 2076 4031 
Pull-ups ___ sated te 4543 —.1109 | —.2172 | —.0581 2693 
300-yd. Shuttle = |= .3705 1358 | —.2557 | 1909 .2575 
60-sec, Squat Thrust..._.| 3437 | ~—.3124 | ~.0255 | 1101 2283 
Push-ups | 4871 | —.3706 | —.2208 | —.3030 5152 

| | 
100-yd. Pick-a-back —| 3819 | —.2756 | 4222 | —.2661 4708 
Running Broad Jump... 6913 .1761 1274 —.0713 5303 
16-lb. Shot Put... 4-3 1953 | —.1634 | —3026 5982 
75-yd. Pick-a-back | 4454 | —.2334 | 3957 | —.2049 4515 
Dodging Run...) «Ss .4608 «| ~~ .2620 | —.1724 | —.0841 3177 
6-sec, Run | 24971 | 1793 | —.0118 | 2361 3351 
Vertical Jump dtl jell 6406 0735 | .1923 | 0156 4531 
Standing Broad Jump. — 6399 0465 .1993 | 0142 4516 
Dips | 5234..| —.2229.| —.1595 | —.3176 4499 
250-yd, Shuttle | 3463 | = 2988 | —.1566 | 1143 2468 
2-min. Sit-ups | 4062 -|  —.3545 -| —.2456 2350 4062 
100-yd. Dash | = .4472 —.2747 | ~~ 2713 0919 3575 
Standing ae -step-j jump 4923 3627 | 0917 | —.0615 3862 
75-yd. Dash - 7094 | —.0910 |} .2657 1673 |: 6100 
150-yd. Dash | -7116 | 1309 | 0762 0856 5366 
220-74. Desh 6821 |.  .2853 0511 .1895 5853 
Hurdle Run — ol 4739 | —.0621 | —.1447 .1697 .2782 
Medicine Ball Put... 6061 2918 | —.0730 —.2518 5213 
3 Standing Broad Jumps. 6863 | 0741 | 1984 0226 5163 
300-yd. Run 2 6097 | 1498 | — .1635 1102 4329 
200-yd. Shuttle 6340 | 2920 | —.2599 0969 5642 











= Isolated Factor. 


The next step in the analysis was the rotation of the factor loadings is 
make possible a meaningful interpretation of the factors. Rotated factor load- 
ings are given in Table 3. The communalities of each test are shown by the 
symbol h,?. Inspection of Table 3 shows the negative correlations to be at 
a minimum. It shows no negative correlation is significant at the 1-per-cent 
level of confidence. The communalities of the majority of tests are compar- 
able, and all tests show a relatively high uniqueness. 


Isolated factors were then identified, and in order that the results of the 
study could be made meaningful and useful, equations were developed which 
could be used as predictive indices for the various isolated factors, as well 
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TABLE 3 
Rotated Factor Loadings 





ci mn | mw | ow | oe | 
i | | 
EE | 2853 1064 | 1515 | 5362 | 4032 
Pull-ups Be wah ae 2179 .1736 2472 | 2693 
300-7d, Shuttle | 1078 3472 —.0235 3535 | 2576 
60-sec. Squat Thrust | 1648 —.0063 3365 2966 | .2284 
PRD acho | 6221 0452 3207 | 1524 | 5151 
| 

100-yd. Pick-a-back | 41663 | 0466 6305 | —.2082 | .4707 
Running Broad Jump | 1693 | ~~ 5900 3877 | —.0565 |  .5303 
16-lb. Shot Put. | 4726 5867 1740 | —0183 | 5982 
75-yd. Pick-a-back | —.1250 .1203 6283 | 1632 | 4515 
Dodging Run = 2365 5015 0184 | 0998 | 3177 
Sa ae 1951 3657 | 3949 | 3351 
Vertical Jump | «= «0890 4796 | 4527 | 1011 | 4531 
Standing Broad Jump 0944 4588 4704 1041 | ~~ .4515 
Dips - bit a 5631 1798 3068 0797 | 4499 
250-yd. Shuttle 0417 4533 —.0550 1913 | 2468 
2-min. Sit-ups - = .2418 .0036 2504 5338 | 4061 
100-yd. Dash ____ te 0905 5693 1549 3575 
Standing Hop- -step- jump ‘ 0513 5979 1572 —.0365 3862 
75-yd. Dash __ oa 0235 4018 6225 2462 6101 
150-yd. Dash ___ 1151 5695 3889 2188 5367 
220-yd. Dash - | —,0003 6657 .2780 .2547 5853 
Hurdle Run _. al .1594 .2675 .2072 3719 .2782 
Medicine Ball Put. —.0015 6202 .1508 3375 5213 
3 Standing Broad Jumps 0972 5102 4824 1179 5164 
300-yd. Run __ 0064 5162 .3780 .1540 4331 
200-yd. Shuttle - - .2179 6388 0452 3265 5642 




















K = Isolated Factor. 


as for the combination of all factors. The rotated factor loadings and the 
correlation matrix were used as a basis for the developed equations. The 
Doolittle method of multiple correlation was employed in all instances for 
the purpose of obtaining the Beta weights, or relative weightings, of each 
item used in a particular index. 


Discussion and Interpretation 


Factor number one is identified as arm and shoulder girdle strength and 
co-ordination or arm extensor endurance because of the relatively high cor- 
relation with push-ups and dips. Significant correlations with pull-ups, 16-lb. 
shot put, sit-ups, and two-min. sit-ups are also shown. Pull-ups and the 16-lb. 
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shot put are generally considered to be tests in which this factor is present. 
The fact that the hands were clasped behind the head during the performance 
of sit-ups and two-minute sit-ups might lead to the conclusion that sit-up 
tests performed in this manner are measuring arm and shoulder girdle 
strength and co-ordination to some extent. Push-ups have the highest cor- 
relation (.6221) with this factor. 

Identification of this factor is also possible through the zero correlation 
of certain test items with this factor. No significant correlations are shown 
with any of the running events. The power events such as the vertical jump, 
standing broad jump, and standing hop-step-jump all show insignificant 
relationships with this isolated factor. This would tend to support the identifi- 
cation of this factor since the majority of the test items which show a zero 
or insignificant relationship to the factor, are items which are commonly 
accepted as not having the strength and co-ordination of the arm and shoulder 
girdle muscles as a primary factor in the tests themselves. 


Factor two is identified as strength or power of leg extensors because of 
the correlations with tests that are known to include strength as a factor. 
The relatively high correlations with events which have been designated as 
power items substantiates this identification. McCloy states that power is 
equal to force times velocity. If we interpret force to mean strength, and 
velocity to mean speed, the correlations of the power events show significant 
correlations with the strength factor and also significant correlations with 
factor three. The highest correlation with this factor is the 220-yd. dash. 
It is interesting to note that as the distance of the dash events is increased, 
excluding the 100-yd. dash, the correlation with the factor is increased. 

Several of the resultant correlations of the various test items with this 
factor of leg extensor power support the identification of this factor. Sit-ups 
and two-min. sit-ups do not show significant relationship to the isolated factor, 
nor does the 60-sec. squat thrust which is generally considered a measure of 
agility. There is a rather large difference shown between the correlation of 
pull-ups with the factor when compared to the existant correlation of push-ups 
and this variable; however, the large correlation is not significant, and these 
tests appear to the primary measures of factor one. 

Factor three is identified with reservation as a velocity factor because 
of the relatively high correlations with test variables in which speed is known 
to be a factor. Identification as power of hip extensors is plausible since 
known power items also show significant correlations with this factor. The 
100-yd. pick-a-back has the highest correlation (.6305) with this factor; 
comparable correlations also exist between the 75-yd. pick-a-back (.6283), 
the 75-yd. dash (.6225), and this third factor. 

It is interesting to note that the shuttle runs all show a zero correlation 
with this factor. It would seem that velocity should be a factor in those 
events; however, the relationships of the shuttle runs appear to be more 
highly related to factor two and also to the fourth isolated factor. 
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Significant relationships with the majority of test variables are shown with 
this factor. This appears plausible since the majority of tests have speed com- 
monly accepted as a part of the particular test items. 


Several zero or low relationships of test items (dodging run and 60-sec. 
squat thrust) are shown with this factor. These discrepancies are not mean- 
ingful and any attempt to rationalize the results would be invalid. 


Factor four is identified as power of thigh flexors with the identification 
based on the correlations of the various tests of this factor. The highest 
correlations with this thigh flexor strength factor are sit-ups to fatigue 
(.5362) and two-min. sit-ups (.5338), and many variables which are known 
to have common use of the thigh flexor muscles also have significant correla- 
tions with this factor. 


It would appear that this factor could be identified as abdominal strength; 
however, the literature appears to support the contention that sit-ups are more 
a measure of thigh or hip flexors than of abdominals. The Psoas Major, the 
Psoas Minor, the Iliacus, the Pectineus, and the Sartorius are the principle 
flexors of the hip or thigh. The abdominal muscles do come into action to 
some extent in the correct performance of sit-ups; however, they are not 
the principal muscles involved. If strengthening the abdominals is to be 
one of the major considerations of military tests, some other test must be em- 
ployed which will more adequately measure abdominal strength. 


Several factors are included in the battery of the 26 variables analyzed 
which are ordinarily thought of as tests of agility. Agility has been defined 
by McCloy as the ability to change very rapidly the direction of the body 
or parts of the body in space. The dodging run has a relatively high (.5015) 
correlation with the leg extensor factor, and no correlation with the factor of 
hip extensors. The relationship of this test variable to factor two seems 
plausible, since in this event the performer must stop and change direction 
quickly. The general strength of the individual would be a contributing asset 
for success in this event. It appears that there should be a significant correla- 
tion with factor three, but the writer has no explanation for the zero relation- 
ship. 

The hurdle run, supposedly a measure of agility and a test similar in nature 
to the dodging run, is not subject, in this study, to the same interpretation. 
The correlations of this test and the factors show an insignificant relationship 
to factor one, but significant correlations with the remaining three factors. 
The highest correlation (.3719) of this test was with factor four. 

The 60-sec. squat thrust or Burpee showed no significant relationship to 
either factor one or factor two. The highest correlation (.3365) was with the 
factor three. The communalities of these three tests of agility are relatively 
low, although significant. It appears that further analyses of tests of agility 
are necessary in order to discover a measure of the trait as defined previously. 

Speed tests, as measured by the dashes and runs, were included in the 
analysis. As expected, the correlations of these tests with the factor three are 
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all significant. It is well, perhaps, to emphasize that for the most part as the 
length of the dash or run increases, the relation to the factor three decreases; 
and an increase in relation to the factor two is evidenced. 


The 75- and 100-yd. pick-a-backs are commonly thought of as tests of 
speed and strength; however, the correlations of these events with factor two 
are very low. The only significant correlations of these tests to a factor are 
the relationships to factor three. The correlation of the 75-yd. pick-a-back 
is .6283, and that of the 100-yard pick-a-back is .6305, when related to the 
power of hip extensor factor. Thus, it is quite probable that the pick-a-back 
tests are measures of hip extensors more than measures of power of leg ex- 
tensors, 


The results of the 100-yd. dash are not in harmony with the interpretations 
of the 75-, 150-, and 220-yd. dashes with reference to the power of leg ex- 
tensors factor. Although the 100-yd. dash has a high correlation (.5693) with 
the power of hip extensor factor, it shows no significant relationship to any 
of the three additional isolated factors. This conclusion is difficult to explain. 
It would seem that the resultant correlations would agree somewhat with the 
results of the other speed or dash events; however, this is not the case, and 
no explanation is presented. 

The 6-sec. run shows a significant correlation with the factor three, but 
the highest relationship of this test is with factor four. These correlations are 
not appreciable, and since the communality is only .3351 it appears that 
there may be a specific factor in this test not common to any of the other test 
items. 

The shuttle runs included in this study all show significant correlations 
with the factor two and the thigh flexor factor. It is commonly thought that 
these events would also correlate highly with the factor three; however, no 
correlation is evidenced in this study. These tests require the ability to use 
the large muscle groups of the legs to stop quickly and change direction. 
Speed between the turning points does not appear to be an important ability 
in these tests as does the ability to stop quickly and reverse direction of 
movement. 

When the shuttle was removed from the 300-yd. event (shown by the 
300-yd. run), the correlation with the factor two was increased. It is interest- 

. ing to note that the correlation with the factor three was also appreciably 
increased. 

Power was measured by a series of tests commonly accepted as meeting 
the force times velocity principle. It is evident that all of these tests—running 
broad jump, standing broad jump, vertical jump, standing hop-step-jump, 
and three standing broad jumps—are in close agreement. All show similar 
correlations with the factor two, as well as with the factor three. Apparently, 
then, all these tests are measures of power if we interpret strength as force 
and speed as velocity. 

Sit-ups and two-min. sit-ups are commonly thought of as being pre- 
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dominantly measures of abdominal strength. Football coaches, wrestling 
coaches, and physical education instructors for some time have been using 
this event as the best method of increasing the abdominal strength of an 
individual. It is the belief of the investigator that sit-ups and variations of 
sit-ups are more indicative of the strength of the trunk flexors than of the 
abdominal muscles. The two sit-up events included in this study have the 
highest relationship to the thigh flexor factor. Significant correlations are 
also observed between these two events and factor one. This might be ex- 
plained by the fact that the hands clasped behind the head appear to aid 
considerably in the performance of this test. 

Pull-ups, push-ups, and dips are all generally considered to be measures of 
arm and shoulder girdle strength. Correlations of these three tests with 
factor one are high when compared to other test correlations with this factor. 
Significant correlations are also apparent with the factor three. This seems 
a reasonable expectation since the more rapidly these tests are performed, 
the better the performance in terms of number completed. 

The 16-lb. shot put shows high correlations with the factor two. This 
result is in agreement with the general conception of the abilities involved 
in putting the shot. 

The medicine ball put is generally considered a power item and shows a 
high relation (.6202) with the factor two. A significant correlation is also 
observed between the medicine ball put and the thigh flexor factor. The 
negative and low relationships with factors one and three are not meaningful, 
and no explanation is attempted. Although there are several instances where 
no explanation can be given relative to the correlations derived, the majority 
of the test and factor intercorrelations are capable of meaningful interpreta- 
tion. 

Regression equations in standard score form were developed for predicting 
each of the isolated factors. Additional predictive indices were developed 
for 1. combined factors, 2. test items having the highest communalities, 3. test 
items selected according to the principle of being relatively easily adminis- 
tered, and 4. test items selected on the basis of a high correlation with the 
criterion measure and a low intercorrelation between test items. All of the 
above equations for prediction are available from the author. 


Conclusions 


1. Of the 26 selected test items which are supposedly measures of strength, 
speed, agility, endurance, power, balance, flexibility, co-ordination, and card- 
io-respiratory endurance only four factors were identified. The low com- 
munalities of each of the test items indicates the presence of a specific factor 
or factors not isolated. 

2. The four factors isolated were identified in terms of the resultant cor- 
relations between the individual tests and the factors. Isolated factors were 


identified as: 
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I Arm Extensor Endurance 
II Power of Leg Extensors 
III Power of Hip Extensors 
IV Power of Thigh Flexors 


Factors are identified with considerable reservation because of the low 
factor loadings and communalities. 
3. Regression equations in standard score form are available for predic- 
tion in terms of isolated factors. 
4. A predictive index for combined factors was developed and is available. 
5. An index of test items having the highest communalities was developed. 
6. The recommended four-item battery selected on the basis of a high 
correlation with the criterion measure and a low intercorrelation between test 
items showed a multiple R of .9040. The equation is: 
X. = —6.7252 (75-yard dash) + 
.0797 (16-pound shot put) + 
.1871 (3 standing broad jumps) 
+ .5601 (dips) + 38.0657 
7. The results of this study are not in harmony with the measurements of 
agility as defined. Strength appeared to be the most significant factor in the 
successful performance of this trait. 
8. Dashes included in this analysis showed high relationships with the iso- 
lated factor of hip extensors. 
9. Recommended test batteries are not in agreement with the batteries 
developed and used during the period of World War II. 
10. There is a need for further study in the area of motor fitness in order 
to determine what other factors, if any, these test items are measuring. 
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Abstract 

Freshman high school boys were given three weeks of group practice in their physical 
education class, in running 80 and 160 yd. at a steady pace. A timing gong was sounded 
at the instant the runner was supposed to be crossing each of a series of 20-yd. marks, 
During the third week, individual time trials were made in a 220-yd. run without the 
gong. Two runs were of the steady-pace pattern and a third was made at “all-out” 
speed throughout to secure a variable-pace high-decrement pattern. Time trials were 
repeated during a fourth week. The steady-pace run average 0.6 sec. faster (¢ = 3.85) 
than the all-out run, and the boys reported less fatigue. 
IT HAS BEEN KNOWN for many years that the oxygen requirement of 
running increases very rapidly as the velocity becomes greater. Physiologists 
have recommended the even pace as more economical of energy. Neverthe- 
less, most coaches and athletes advocate and use a variable-pace design for 
running the longer sprints and middle distances. 


Recommended Patterns 

For the 220-yd. race, good runners should sprint the first 75 yd., float 75 
yd. (while the runner slows his speed slightly), and then drive the last 70 yd., 
according to Gauthier and Haney (4). This opinion corresponds to Crom- 
well’s: “If they will learn how to run their best 200 yds. in advance, knowing 
when to ease off from the opening burst of speed to the coast, and when to go 
from that to the final gather, they are most likely to win” (2, p. 58). 

Doherty feels that in the 220, the problem is to achieve maximum velocity 
and then maintain it as long as possible. “The speed should very gradually 
but consistently fall away after the 60 yard mark through to the finish.” He 
shows a typical all-out deceleration curve as an example (3, p. 74). In both 
the 440 and 220, he thinks that there should be a gradual and natural decel- 
eration to make sure that the runner crosses the tape completely exhausted. 
He does not approve of the sprint-float-drive pattern recommended by Crom- 
well and others who advocate that system (1, 2, 4). 

Murphy (9, p. 42) as early as 1914 thought that in running the 220-yd. 
dash the runner must learn to “swing through this race at almost one’s best 
speed for the first 180 yd. without actually trying every step of the way as in 
the 100-yd. dash.” In other words, he advocates a relatively even-pace type 
of run with less variation in speed than in the variable-pace patterns advo- 
cated by coaching textbooks. 


1 From the Research Laboratory of the Department of Physical Education. The present 
address of the writer is El] Camino High School, Sacramento, California. Appreciation is 
expressed to Dr. Franklin M. Henry for advice during the course of the investigation. 
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Coaches who advocate the even-pace race have the support of the physiolo- 
gists who have studied the problem of energy cost and speed of movement. 
This topic, particularly as it applies to running, has recently been reviewed by 
Henry (8). “It can be said with confidence that insofar as the physiological 
limit is involved in setting records, a steady pace will result in faster time for 
the 220 and 440 as well as the mile and two mile runs” (7). 

Calculation of Advantage. Time-velocity equations which describe the posi- 
tion and speed of the runner at any time in either the steady (even) pace or 
the all-out pace have been developed (8). By using known oxygen requirement 
data and these equations, it is possible to estimate the amount of oxygen 
required for a run of a certain speed and velocity profile. By equating the 
metabolic costs of the two types of race, the relative efficiency of one type of 
race over the other has been computed (8). 

No one seems to have investigated the comparative performance of run- 
ners in actually using the different types of velocity patterns under controlled 
conditions. 


Purpose of Study 

It is the purpose of this study to test the relative effectiveness of the steady- 
paced type of run compared with the variable-pace or high-decrement run, 
and to determine if it is feasible and practical to train high school students 
to run at a steady pace. 

Steady pace is defined as the maintaining, after acceleration, of a constant 
velocity throughout the entire run. However, to insure that the runner has 
put forth full effort, there should be a slight dropping off in velocity during 
the very last part. In the variable pace, the runner will reach his maximum 
velocity in the early part of the run, and from that point on his velocity will 
decrease progressively. 


Method 


Subjects. Thirty-nine boys, who were volunteers from the regular Fresh- 
man physical education classes of the El Camino High School at Sacramento, 
California, were used in the study. None of them was experienced in running. 

Apparatus. The set-up for the experiment was a modification of the one 
suggested by Henry (7). The starting signal for each run was given by a 
clap-board. The gong arrangement for the steady-pace training consisted of 
wooden tripod 5 ft. high with a steel bar, 1 in. in diameter and 18 in. long, 
hanging by a line from the center of the tripod. This hanging bar was struck 
vigorously in cadence with a metronome set according to the Henry tables 
(7). Tenth-second stop watches were used for time checks at 70, 130, 190, 
and 220 yd. for the time trials. The rates of these watches were all checked 
against the “standard” watch, and corrections made as required. The 220-yd. 
straightaway past the bleachers was used as a track, and the gong was set up 
in the bleachers approximately 80 yd. from the starting line. The check 
marks on the track for the training consisted of high hurdles placed on the 
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curb at 20-yd. intervals with the exception of the first one which was placed 
at 21 yd. 

Procedure. The experiment began April 19 and lasted for five weeks. The 
subjects were told that the study would consist of a comparison of two ways 
of running. They did not know and were not told which type might be ex- 
pected to give the best results. They were reminded several times during the 
course of the study of the importance of doing their best to run th. way they 
were asked and to put forth every effort at all times. They were instructed in 
the start, running form, and the necessity of a suitable warm-up every day 
before the run. The medium-stance start, which has been found to be the best 
for most runners (5, 7) was used. 

Training and Practice. First, the subjects ran 220 yards as fast as they 
could and the finish times were recorded. With these known times, the sub- 
jects were grouped to expedite their training for the steady-pace pattern. For 
each group, a metronome rate was chosen to give a total time just under their 
unit group’s previous total time. No training group had over six subjects 
running at one time, and several groups had only one or two in them. The 
subjects were instructed to pull off of the track if they were not within one 
stride of the 20-yd. check marks at the sound of the gong. 

During the course of the experiment the subjects were shifted from one 
group to another as their total times warranted, or if they thought they could 
run at a faster rate. After the first classification run, the subjects trained in 
their steady-pace groups for nine days (two calendar weeks). Each day they 
made two practice runs, the first a short one of 80 yd. and then a longer one 
of 160 yd. It had been noticed that the subjects had trouble keeping the first 
40 yd. slow enough for the proper steady-pace pattern, so the 80-yd. distance 
was used to give additional practice on the first part of the run. 

After the first nine practice sessions, individual time trials for the steady- 
pace run, without the gong, were made on two consecutive days with time 
checks at the 70-, 130-, 190,- and 220-yd. markers. The following day, each 
subject ran individually by the high decrement pattern (time checks were 
taken again at the designated spots). The experimenter and the timers gave 
verbal encouragement at each check point, as had also been done at the 190- 
and 220-yd. stations of the steady-pace runs. The subjects then practiced on 
the steady-pace run again for three days, and the 220-yd. individual time trial 
runs were repeated on two consecutive days. On the following day, the sub- 
jects each made a high decrement run of 220 yd. The next day (Friday) and 
on Monday and Tuesday of the following week, there was 80- and 160-yd. 
practice on the steady pace. On Wednesday and Thursday, two steady-pace 
220-yd. runs were timed, and on Friday the last run (by the high decrement 
pattern) was made. 


Results and Discussion 


The scores of the last three time trials were examined statistically because 
it was felt that the subjects’ performances were fairly stable by then, and that 
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there would no longer be much of a training or practice effect within a few 
consecutive runs. The very small difference between the two steady-pace runs 
seems to confirm this expectation; it was 0.03 seconds and not statistically 
significant (t = 0.15). However, the high decrement run was considerably 
slower, to the extent of 0.60 seconds. This difference is statistically significant 
(t = 3.85).? It is more clear-cut when the two steady-pace runs are averaged 
to secure greater reliability. The average of these, compared with the high- 
decrement run, gives a difference of 0.53 seconds (about the same as before) 
but the t ratio is larger (t = 4.31). 


In order to balance out the possible training effect that might be present, 
we can look at the 220-yd. time trials for the last two weeks of the study. 
There were two steady-pace runs in the fourth week and two in the last week. 
In each case, the pair of steady-pace runs was followed by one high-decrement 
run. This gives us the average of four even-pace runs to compare with the 
average of two all-out runs (Table 1). This comparison shows the mean 
steady-pace was run 0.48 sec. faster than the mean high-decrement run, with 
the t ratio = 3.51 which is highly significant statistically. 


TABLE 1 
Time Trials Using Two Patterns of rans 


‘Patter n 


___ Steady Steady High Decrement 

Final week sete 30.68 
Previous week i : 30.92 
Average 


30.65 31.26 sec 
31.44 31.54 sec 


} 
} 
30.92 | 31.40 sec 


| 
| 
| 
| 
| 
| 
| 





In Figure I it can be seen that the steady-pace run was not quite done in 
the theoretically perfect constant velocity pattern. However, the profile was 
definitely “flatter” than in the all-out or high-decrement type of run. 

We do not know exactly how irregularly these runs were made. Assuming 
that the range of average velocities is the difference between the highest and 
lowest means as measured at the first titning station and the last or next to 
last (whichever was slowest), the greatest variation from the average velocity 
of the run was 5.7 per cent for the steady-pace run and 14.0 per cent for the 

2 The standard error of the difference, 0.167 sec., includes sampling error and timing 
error. While it was not determined, the mean timing error was probably small. Cureton 
and Coe (Research Quarterly 4: 94-109, 1933) give data that show the variable error 
of a single timer for a single run averages about 0.13 sec. Using this figure, the mean 
timing error for the difference between two runs would be 0.02 sec., which is negligible 
compared with the errors that arise from sampling and intra-individual variation. For a 
discussion of this problem by Henry et al., see p. 465 of Scott (ed.), Research Methods 
Applied to Health, Physical Education and Recreation, Washington: Am. Assn. for Hlth., 
Phys. Educ. and Rec., 1949. 
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high decrement run. Assuming that the area of discrepancy between the per- 
fect steady pace and the run as made is triangular and the altitude at half its 
area approximates the mean discrepancy in velocity, these variations would 
be multiplied by 0.707 and become 4.02 per cent and 9.9 per cent. 


From Henry’s tables and curves (8), based on physiological efficiency, the 
steady-pace run would therefore have been done 0.64 per cent slower than a 
theoretically perfect even-pace, and the high decrement run would have been 
2.75 per cent slower. In this experiment, therefore, the advantage of the 
relatively steady-pace run, when compared with the high-decrement run, 
should be 2.11 per cent. The observed time difference did significantly favor 
the steady-pace type of run and the observed advantage was 1.96 per cent in 
the time trials of the final week. Under the circumstances, the agreement 
between the theoretical and observed advantage is very high. 


The motivation was probably better in the all-out high-decrement run than 
in the steady-pace run. In the latter, verbal encouragement could only be 
given near the end of the run, while in the all-out run the timers at each 
check point were able to urge the runner to go faster. The experimenter had 
the general impression that the subjects were more nearly exhausted in the 
high-decrement run. Verbal reports collected from the subjects, while not 
very well controlled or dependable, agree with this impression. 

More practice was given in the steady-pace pattern than in the other. The 
reason was that judgment of pace had to be developed in order to make an 
unguided even-pace run. In the high-decrement run, judgment of pace is not 
required, as the subject runs as fast as possible all the way down the track. 
Other than this, there were no special skills developed that could favor one 
race pattern over the other one. With more practice, the boys would prob- 
ably have come closer to the exact even pace. Older boys could be expected to 
learn the pacing more rapidly. An automatically starting, electrically ampli- 
fied metronome would be very useful. 

It is of interest that the steady-pace run in this experiment (which was 
intended to be closer to the “flat” profile than was actually accomplished) was 
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fairly close to the Doherty recommendation of “gradual deceleration” for the 
sprints and middle distance events. The important finding, however, is that 
the physiologically better pattern of the two profiles studied experimentally is 
the one that resulted in the best 220-yd. times. Murphy’s idea of the best way 
to run this event (9) is therefore given support, and Henry’s calculations 
have been confirmed in principle. The advantage would be greater in longer 
runs such as the quarter and half mile distances (8). It was not practical to 
train high school Freshmen in these events during the regular physical edu- 
cation period. 


Summary and Conclusions 


Thirty-nine male high school freshmen, inexperienced in track, were 
trained to run at a relatively steady pace for 220 yd. The purpose was to see 
if it was practical to teach them even-pace running, and to discover if this 
pattern of running was faster than the variable-pace pattern. 

The steady-pace practice consisted of passing 20-yd. markers each time a 
pacing gong was sounded. The training periods were made up of short prac- 
tice runs for a period of two weeks, followed by two weeks in which the sub- 
ject on different days ran two time trials at an even-pace and one according 
to the high-decrement or “all-out” pattern. In the time trials, stop watches 
were used at the 70-, 130-, 190-, and 220-yd. markers in order to check the 
velocity patterns. 

It was found that the 220-yd. speed in the steady-pace pattern was signifi- 
cantly faster than in the variable-pace pattern. The amount of advantage 
(2%) agreed well with the theoretical advantage calculated by Henry. 

While three weeks of practice was insufficient to teach the boys to slow 
down enough in the first half of the run to achieve a true steady pace, they 
did come close enough to the desired pattern to show its advantage over a 
more variable velocity pattern. 
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Abstract 

This article presents an analysis of the effect of eight physical education activities 
on physical fitness and motor ability scores of male college Freshmen. The mean gain 
was computed for each group on both tests and individual test items. The mean gain 
of each group was compared with each of the other groups and significance of differences 
determined, The results seemed to indicate that of the activities studied tumbling- 
gymnastics and conditioning best developed those abilities measured by the physical 
fitness test; while tumbling-gymnastics and wrestling best seemed to develop those 
abilities measured by the motor ability test. 


AMONG THE RECOGNIZED objectives of physical education are the objec- 
tives of developing physical fitness or motor fitness and motor ability. While 
considerable research has been done to establish tests to measure these abili- 
ties, relatively little experimental data are available concerning the contribu- 
tion of a given activity to a specific objective. The value of an investigation 
of this type lies in the light it may shed on which activities best develop cer- 
tain desired objectives within the participant. 

The objectives considered in this study are limited to physical or motor 
fitness and motor ability as measured by the tests selected. The relative value 
of developing skills in certain activities, e.g., ability to swim, over-increasing 
physical fitness or motor ability was not considered in this research. 


Statement of the Problem 

This study was designed to compare eight selected physical education ac- 
tivities in their development of “physical fitness” and “motor ability” in the 
students participating in these activities. The criteria for determining these 
two factors of physical fitness and motor ability were the Physical Fitness 
Test (1) and the Larson Test of Motor Ability (5). 


Related Studies 


In 1943, Wilbur (7), in comparing two types of physical education pro- 
grams, sports method versus formalized apparatus method, in terms of physi- 
cal fitness test results, found: 

1. The sports method evidenced significant superiority (total test) over 


1 This study was made in partial fulfillment of the requirements for the degree of 
Doctor of Education in the school of Physical Education, Pennsylvania State College, 1951. 
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apparatus method for the last one-half of the experiment. The sports 
method was also superior in developing arm and shoulder-girdle 
strength (chins and dips), and co-ordination, agility, and control (bar- 
snap). 

2. The two methods were equal in improving speed of legs (dodging run), 
jumping or leg strength (jump and reach), arm and shoulder-girdie 
co-ordination (baseball throw), and endurance (one-quarter mile run). 

3. The formalized apparatus method did not show significant superiority 

to the sports method on any of the tests used. 

In a study of the influence of selected physical education activities on en- 
durance test scores, Fordham (4) and Berrafato (2) in concurrent but sep- 
arate studies found that students taking the basic conditioning course made 
the greatest improvement on the endurance test. Mean gains for each of the 
activities were: basic conditioning 19.35; apparatus 12.84; boxing 11.29; 
weight lifting 11.06; intramural athletics 9.40; individual tumbling 8.05; 
wrestling 6.50; volleyball 5.09; and badminton 3.07. The mean gain is indi- 
cated in terms of standard scores. 


Activities Used 


The activities selected for this investigation were elementary courses in 
swimming, boxing, weight training, tennis, wrestling, volleyball, tumbling- 
gymnastics, and a basic conditioning course. The conditioning course con- 
sisted of calisthenics, running, pull-ups, and grass drills. The tumbling-gym- 
nastics course combined both mat tumbling and apparatus work. The classes 
met three times a week for one hour for instruction and participation in the 
designated activity. The primary objective for each course except the basic 
conditioning course was to develop playing or performance skill in that spe- 
cific area. All classes were taught by experienced full-time college instructors. 


Measures Used 


The test used as the criterion measure to determine motor ability was the 
Larson Test of Motor Ability (5) consisting of four test items: 1. baseball 
throw for distance; 2. pull-ups; 3. vertical jump; and 4. bar-snap. The test 
used as the criterion measure of physical fitness was the Phvsical Fitness Test, 
which was a slight modification of the Army Air Forces PFR test (1). The 
Physical Fitness Test consisted of three test items: 1. 300-yard shuttle run; 
2. pull-ups; and 3. sit-ups. 

The term “motor fitness” might be more appropriately used than the term 
“physical fitness” which usually denotes “fitness for a given task.” 


Procedure 

The study was conducted during the fall semester of the academic year 
1950-51, at the Agricultural and Mechanical College of Texas. The subjects 
for the experiment were first-semester Freshmen at this institution. All in- 
coming Freshmen (N = 1031) were tested by the Physical Fitness Test. Stu- 
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dents were matched by means of the physical fitness test and age in eight 
parallel sections. A third factor was used in the selection of the swimming 
group; namely, all students assigned to this group had failed a rather rigid 
swimming test. 

An analysis for significance of difference on the group selection factors— 
(1) physical fitness test, and (2) age—showed that there were no significant 
differences between any of the eight groups on these two measures. All groups 
were tested for motor ability by the Larson Test of Motor Ability. A C.R. 
analysis of initial scores on the Larson test indicated a significant difference 
at the 5-per-cent level of confidence between two groups—weight training and 
conditioning, the weight-training group having the better scores. No other 
significant differences were noted. Each group, except the swimming group, 
was then arbitrarily assigned to one of the selected activities in which they 
participated and received instruction for one semester. At the end of the 
semester all groups were again tested using the same two measures. To re- 
duce outside variables in physical education activities, all students were elimi- 
nated from the study who had participated in intramural activities more than 
three hours during the semester. Thus, the final sample was reduced to 432 
students. 


PHYSICAL FITNESS TEST 


At the beginning of the experiment no significant differences existed be- 
tween the eight groups on physical fitness scores. Seven of the eight experi- 
mental groups (Table 1) showed a significant improvement in performance 
on the physical fitness test.2_ The swimming group was the only group which 
did not improve significantly in abilities measured by this test. 

A comparison of the mean gain of each group with the mean gain of each 
of the other groups is shown in Table 2. The t-test for the significance of 
differences showed that the tumbling-gymnastics and conditioning groups 
were superior to each of the other activities in developing physical fitness. 
There was no significant difference in these two activities when compared 
with each other. The test items used to determine physical fitness were prac- 
ticed by the conditioning group as a part of the regular class period and it is 
quite possible that practice in the specific test items could invalidate the test 
for that group. The tumbling-gymnastics group was significantly superior to 
the tennis, volleyball, swimming, boxing, and weight-training groups. Condi- 


2For quick reference in determining significance of t, the following figures may be 
used: 
Even-numbered tables 
t = 2.63 is significant at the 1-per-cent level of confidence 
t = 2.36 is significant at the 2-per-cent level of confidence 
t = 1.98 is significant at the 5-per-cent level of confidence 
Odd-numbered tables 
t = 2.68 is significant at the 1-per-cent level of confidence 
t = 2.40 is significant at the 2-per-cent level of confidence 
t = 2.01 is significant at the 5-per-cent level of confidence 
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tioning showed a significant superiority over tennis, swimming and boxing, 
volleyball and wrestling, 

The data indicated that there was no significant difference in weight- 
training, wrestling, volleyball, and boxing in developing physical fitness in 
students who participated in those activities. All groups evidenced sufficiently 
superior to the swimming group so as to be considered significant at a level 
of confidence of at least 5 per cent. 


LARSON TEST OF MOTOR ABILITY 

As previously noted, the C.R. between the eight groups on the initial test 
showed two groups, conditioning and weight training, having a difference 
considered significant at the 5-per-cent level of confidence. 

In analyzing the mean gain between the initial test and post-test for each 
group (Table 1), it was found that four activity groups—wrestling, condi- 
tioning, volleyball, and tumbling-gymnastics—made gains that were consid- 
ered significant at the 1-per-cent level of confidence. The weight-training and 
tennis groups made gains that were significant at the 2-per-cent and 5-per- 
cent levels respectively. The swimming and boxing groups failed to make 
gains that were statistically significant. 

The t-test for significance of differences in mean gains (Table 2) showed 
that wrestling and tumbling-gymnastics were superior to each of the other 
six activities in developing motor ability as measured by the Larson Test to 
a degree that was considered significant at the one per cent level of confi- 
dence. There was no significant difference when wrestling and tumbling- 
gymnastics were compared, Conditioning and weight training both developed 
motor ability as measured by the test used to a significantly greater extent 
than did swimming and boxing; while volleyball was superior to boxing. 
Tennis and volleyball seemed to be approximately equal in the development 
of motor ability in students participating in those activities. The results of 
the study indicated that students participating in swimming and boxing de- 
velop less motor ability as measured by the Larson Test than do students who 
participated in the other six activities. 


TEST ITEMS 

Baseball Throw for Distance. Larson identifies one factor of motor ability 
as gross body co-ordination, which he describes as, “The capacity of the 
individual to integrate movements of different kinds (different requirements 
for each phase of the activity) into one pattern.” (6) The test Larson used to 
measure this factor of motor ability is the baseball throw for distance. 

At the initial test phase of the experiment, on analysis of critical ratios be- 
tween the means of the various activities only one significant difference was 
found. The difference, being between the weight-training and tunibling- 
gymnastics groups, was significant at the 5-per-cent level of confidence; 
weight training being the superior performance group. 
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Only one group, tumbling-gymnastics, made a gain that could be consid- 
ered significant at a level of confidence as high as 5 per cent (Table 3). The 
lack of greater improvement in this particular test item may be due, in part, 
to the fact that the initial test came at the close of the baseball and softball 
season when many of the subjects tested had been participating in those ac- 
tivities; while the post-test came four and one-half months later. This is only 
assumption on the part of the experimenter and is not a tested fact. It is to 
be noted, however, that all groups, with the exception of swimming which 
showed a mean loss, did show a slight mean gain although too small to be 
considered significant. 


Table 4 presents an analysis of mean gains for this test item. The improve- 
ment made by the tumbling-gymnastics group was significantly better than 
the conditioning, swimming, and tennis groups; while students participating 
in wrestling made significant gains over those students participating in swim- 
ming and conditioning. Since the critical ratio for significance of differences 
is a matter of degree; then the four activities—swimming, conditioning, ten- 
nis, and boxing—tend to develop to a lesser extent those gross body co- 
ordinations measured by the baseball throw for distance than do the other 
four activities studied. 


Pull-ups. Chinning, or pull-ups, was the only item that was duplicated in 
both tests. As a test item in physical fitness, it is used as a measure of mus- 
cular endurance; or, “The capacity of the individual to continue successive 
exertions under conditions where a load is placed on the muscle groups being 
tested.” (6) As a measure of motor ability, Larson used chinning as a test of 
a factor he described as dynamic strength which he defined as “. . . strength 
determined by the ability to lift the body weight and propel it upward. . .”(5). 

In determining mean gain and significance of mean gain for each activity, 
it was found that two of the groups—wrestling and tumbling-gymnastics— 
made gains that were considered significant at the 1-per-cent level of confi- 
dence (Table 3). The mean gain of the weight-training group was significant 
at the 5-per-cent level of confidence. Two activities failed to show any in- 
crease in performance—swimming, the mean of which did not change, and 
boxing which showed an actual decrease in performance, although not a sig- 
nificant one. 

The t-test for significance of differences in mean gains (Table 4) showed 
that the wrestling and tumbling-gymnastics groups made the most significant 
gains; while boxing and swimming seemed to develop this ability to a lesser 
degree than the other six activities studied. 


Vertical Jump. In classifying tests in terms of the predominant factors 
which they measure, Cozens (3) classifies the vertical jump as a measure of 
jumping or leg strength. Larson (5) identifies it as a measure of a factor he 
terms motor explosiveness; in that the test item requires the use of the body 
weight in a force-time relationship. 
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In performance on the initial test, the groups were homogeneous in that no 
C.R. significant at the 5-per-cent level of confidence was found between the 
- means of the various groups (Table 5). In studying mean gain and signifi- 
cance of mean gain for each activity, three groups—volleyball, tumbling- 
gymnastics, and boxing—made gains that were considered significant at the 
l-per-cent level of confidence. Whereas all other activities showed a slight 
mean gain, no gains were of such proportion as to be considered significant. 

A comparison of the mean gain of each group with the mean gain of each 
of the other groups (Table 6) indicated that the gain made by the tumbling- 
gymnastic group was significantly superior to the gain made by the other 
groups. The volleyball, conditioning, and tennis groups showed a statistically 
significant greater gain than did wrestling and weight-training groups. The 
data also indicated that the students who participated in weight training and 
wrestling least develop those abilities measured by this test item. 


Bar-snap. The bar-snap, as reported by Larson correlated highly with dy- 
namic strength; and Cozens describes it as a measure of body co-ordination, 
agility, and control. 

In an analysis of the significance of differences between the means of the 
various groups on the initial test, two significant differences were found. The 
tennis group, on the initial test, was significantly superior to the conditioning 
group at the 1-per-cent level of confidence. The tennis group was likewise sig- 
nificantly superior to the wrestling group at the 5-per-cent level of confidence. 

Each of the groups showed some measure of improvement between the 
means of the initial test and post-test (Table 5). The t-test for the significance 
of mean gain for each of the groups gave evidence that three groups—wres- 
tling, conditioning, and tumbling-gymnastics—made gains that were of such 
proportion as to be considered significant at the 1-per-cent level of confidence. 
The mean gain made by the tennis group was significant at the 2-per-cent 
level of confidence; while the mean gains reported for the swimming and 
boxing groups were significant at the 5-per-cent level of confidence. 

When the significance of mean gains was analylzed (Table 6), it was found 
that the gain made by the tumbling-gymnastics group was significantly su- 
perior to the gain made by each of the other groups except wrestling. The 
mean gain made by the wrestling group was significantly superior to the mean 
gains made by five groups; namely, tennis, volleyball, swimming, weight 
training, and boxing. 

300-yard shuttle-run. This test item is described as a measure of muscular 
endurance, which is defined as, 


The capacity of the individual for long-continued contractions (submaximum) where 
a sufficient number of muscle groups are used with a sufficient duration and intensity to 
put a demand on the functions of circulation and respiration. 


The means of the groups at the initial test phase ranged from m = 52.81 
sec., weight training; to m = 53.88 sec., wrestling. The critical ratio of the 
difference in means of these two groups was significant at the 2-per-cent level 
of confidence. 
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An analysis of the mean gains (Table 7) between the initial test and post- 
test for each activity indicated that two groups—tennis and swimming— 
showed an actual performance loss, although too small to be considered more 
than chance. The other six activity groups all showed a performance gain 
between initial test and post-test means. The wrestling, conditioning, volley- 
ball, and tumbling-gymnastics groups reported mean gains of sufficient size as 
to be considered significant at the 1-per-cent level of confidence. The mean 
gains of the other activities were not statistically significant. The 300-yard 
shuttle-run was practiced by those students taking the conditioning course. 


In a comparison of the significance of differences between mean gains of 
the various groups (Table 8), the t-test indicated that the mean gain made by 
the conditioning group was significantly superior to the mean gains made by 
the swimming, weight-training, tennis, and boxing groups. Four other 
groups — wrestling, volleyball, boxing, and tumbling-gymnastics — showed 
mean gains that were significantly superior to the mean gains made by the 
swimming and weight-training groups. No other differences in mean gains 
were indicated as being significant at the 5-per-cent level of confidence. 


Sit-ups. This test item was used as another measure of endurance, the pur- 
pose of which was to determine the capacity of the individual to perform suc- 
cessive exertions where a load is placed on certain muscle groups. In this 
exercise, the abdominal muscles are the predominant muscle groups tested. 

There were no statistically significant differences between the means of the 
various activity groups on the initial test. 


An analysis of the mean gain for each activity group between initial test 
and post-test performance (Table 7), five groups—wrestling, conditioning, 
tennis, volleyball, and tumbling-gymnastics—reported mean gains that were 
significant at the 1-per-cent level of confidence. The mean gain made by the 
boxing group was significant at the 5-per-cent level of confidence. No other 
significant gains were recorded. 


A comparison of mean gains (Table 8), between each of the activity groups, 
indicated that the mean gain made by the conditioning group was propor- 
tionately greater to a significant degree than the mean gain reported by each 
of the other groups, with the exception of the tumbling-gymnastics group. 
The mean gain made by the tumbling-gymnastics group was significantly 
greater than the mean gain made by the tennis, volleyball, swimming, weight- 
training, and boxing groups. The mean gain reported for the wrestling group 
was significantly superior to the mean gains reported by the swimming, 
weight-training, and boxing groups. All activity groups, except swimming 
and boxing, reported mean gains that were significantly greater than the mean 
gain of the weight-training group. Of the eight activity groups studied, weight 
training seemed to develop to a lesser degree those phases of endurance nec- 
essary to perform repeated sit-up exercises. 
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Summary 


The purpose of this study was to investigate the relative effect of several 
physical education activities on motor ability and physical fitness. The ac- 
tivities used in the study were: boxing, conditioning, swimming, tennis, tum- 
bling-gymnastics, volleyball, weight training, and wrestling. The tests used 
as the criterion measure to determine motor ability and physical fitness were 
the Larson Test of Motor Ability and the Physical Fitness Test, which was a 
slight modification of the Army Air Forces PFR test. 

The study was conducted during the fall emester of the academic year 
1950-51 at the Agricultural and Mechanical College of Texas, the subjects for 
the experiment being selected from 1,031 male first semester Freshmen at 
that institution. All incoming Freshmen were tested by the Physical Fitness 
Test; and students were selected for the experiment on the basis of homo- 
geneity of two factors, scores of the Physical Fitness Test and chronological 
age. The selected group was then divided into eight groups, keeping the 
groups as equivalent in physical fitness and chronological age as possible. 

A third factor was used in the selection of the swimming group, namely, all 
students assigned to this group had failed rather rigid swimming tests. All 
groups were then tested for motor ability by the Larson Test of Motor Ability. 

Each group was then assigned to one of the designated activities in which 
they participated for one semester. At the end of the semester all groups 
were again tested by the same two tests. To reduce outside variables, in 
physical education activities, all students were eliminated from the study who 
participated in intramural activities more than three hours during the 
semester. 

The data consisted of records of performance on the individual test items 
used and total scores for the two tests at the initial test and post-test stages. 
Statistical treatment of the data was made to calculate means and standard 
deviations for each test item and total test scores as recorded at both testing 
periods. By use of the critical ratio, the significance of difference in mean 
gains was studied to determine if there was a significant difference in the 
relative effect of the various activities on physical fitness and motor ability. 


Conclusions 

The results of these data indicated that improved physical fitness rating 
was equally well attained by those students participating in conditioning and 
by those students who participated in tumbling-gymnastics; whereas swim- 
ming, tennis, and boxing are the activities least apt to increase the student’s 
score on the Physical Fitness Test. It was further found from the study that 
the groups participating in wrestling and tumbling-gymnastics made the most 
significant gains on the Larson Test of Motor Ability. Again, the groups in- 
dicating the least improvement were tennis, swimming, and boxing. Tum- 
bling-gymnastics ranked equal to conditioning in the development of physical 
fitness and equal to wrestling in the development of motor ability; while 
wrestling and conditioning each ranked high on only one of the two tests. 





Influences of Physical Education Activities on Male Freshmen 307 


Further, the tumbling-gymnastics group was the only group that evidenced a 
significant gain between the initial test and post-test means for each of the 
individual test items of both measures used. The other seven groups failed 
to record a significant gain between the pre-test and post-test means in at least 
two or more of the test items. 

The results would seem to indicate that the course using the combined 
activities of tumbling and gymnastics best developed all phases of physical 
fitness and motor ability tested. The data also indicated that the activities 
which tend to least develop these abilities in students are tennis, swimming, 
and boxing. 

The investigation does not mean to imply that the development of specific 
sports skills in any of the recreational areas is secondary to the development 
of physical fitness or motor ability as measured by the tests used. It cannot 
be said, for example, that the ability to score high on the measures used in 
this study is more important than learning the skills necessary to participate 
successfully in activities that are satisfying to the individual. Each has its 
place in the physical education program. 


REFERENCES 

1. Army Air Forces Recuration No, 50-10. Training-Physical Fitness Test. Washing- 
ton, D. C., Headquarters, Army Air Forces, April 28, 1943. 

2. Berrarato, Peter R. The Effect of Various Physical Education Service Courses on 
the All-Round Muscular Endurance of University Students. Unpublished Master’s 
thesis, University of Illinois, 1949. 

3. Cozens, FrepertcK W. The Measurement of General Athletic Ability in College Men. 
Eugene, Oregon: University Press, 1929, p. 132. 

4. ForpHam, SHevpon L. The Effect of Four Selected Physical Education Activities on 
Muscular Endurance Test Scores. Unpublished Master’s thesis, University of Illinois, 
1949, 

5. Larson, Leonarp A., A Factor Analysis of Motor Ability Variables and Tests, with 
Tests for College Men. Research Quarterly, 12: 499-517, 1941. 

6. ————, Some Findings Resulting for Army Air Forces Physical Training Program. 
Research Quarterly, 17: 147, 1946. 

7. Witsur, E. A., A Comparative Study of Physical Fitness Indices as Measured by Two 
Programs of Physical Education: The Sports Method and the Apparatus Method. 
Research Quarterly, 14: 316-332, 1943. 


(Submitted 8/13/54) 





306 The Research Quarterly, Vol. 26, No. 3 


Summary 


The purpose of this study was to investigate the relative effect of several 
physical education activities on motor ability and physical fitness. The ac- 
tivities used in the study were: boxing, conditioning, swimming, tennis, tum- 
bling-gymnastics, volleyball, weight training, and wrestling. The tests used 
as the criterion measure to determine motor ability and physical fitness were 
the Larson Test of Motor Ability and the Physical Fitness Test, which was a 
slight modification of the Army Air Forces PFR test. 

The study was conducted during the fall semester of the academic year 
1950-51 at the Agricultural and Mechanical College of Texas, the subjects for 
the experiment being selected from 1,031 male first semester Freshmen at 
that institution. All incoming Freshmen were tested by the Physical Fitness 
Test; and students were selected for the experiment on the basis of homo- 
geneity of two factors, scores of the Physical Fitness Test and chronological 
age. The selected group was then divided into eight groups, keeping the 
groups as equivalent in physical fitness and chronological age as possible. 

A third factor was used in the selection of the swimming group, namely, all 
students assigned to this group had failed rather rigid swimming tests. All 
groups were then tested for motor ability by the Larson Test of Motor Ability. 

Each group was then assigned to one of the designated activities in which 
they participated for one semester. At the end of the semester all groups 
were again tested by the same two tests. To reduce outside variables, in 
physical education activities, all students were eliminated from the study who 
participated in intramural activities more than three hours during the 
semester. 

The data consisted of records of performance on the individual test items 
used and total scores for the two tests at the initial test and post-test stages. 
Statistical treatment of the data was made to calculate means and standard 
deviations for each test item and total test scores as recorded at both testing 
periods. By use of the critical ratio, the significance of difference in mean 
gains was studied to determine if there was a significant difference in the 
relative effect of the various activities on physical fitness and motor ability. 


Conclusions 

The results of these data indicated that improved physical fitness rating 
was equally well attained by those students participating in conditioning and 
by those students who participated in tumbling-gymnastics; whereas swim- 
ming, tennis, and boxing are the activities least apt to increase the student’s 
score on the Physical Fitness Test. It was further found from the study that 
the groups participating in wrestling and tumbling-gymnastics made the most 
significant gains on the Larson Test of Motor Ability. Again, the groups in- 
dicating the least improvement were tennis, swimming, and boxing. Tum- 
bling-gymnastics ranked equal to conditioning in the development of physical 
fitness and equal to wrestling in the development of motor ability; while 
wrestling and conditioning each ranked high on only one of the two tests. 
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Further, the tumbling-gymnastics group was the only group that evidenced a 
significant gain between the initial test and post-test means for each of the 
individual test items of both measures used. The other seven groups failed 
to record a significant gain between the pre-test and post-test means in at least 
two or more of the test items. 

The results would seem to indicate that the course using the combined 
activities of tumbling and gymnastics best developed all phases of physical 
fitness and motor ability tested. The data also indicated that the activities 
which tend to least develop these abilities in students are tennis, swimming, 
and boxing. 

The investigation does not mean to imply that the development of specific 
sports skills in any of the recreational areas is secondary to the development 
of physical fitness or motor ability as measured by the tests used. It cannot 
be said, for example, that the ability to score high on the measures used in 
this study is more important than learning the skills necessary to participate 
successfully in activities that are satisfying to the individual. Each has its 
place in the physical education program. 
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Abstract 


Recently, there has been considerable interest among coaches in the effects of vitamin 
Bis on athletic performance. This review article summarizes the research reported on the 
source and requirements of vitamin B,, in the diet, the role of this vitamin in red blood 
cell formation and metabolism, the effects of vitamin B.. on growth and work perform- 
ance, and the relative efficiency of oral and parenteral feeding of this nutrient. 


DESPITE THE FACT that evidence for securing improvement in work per- 
formance by dietary supplementation in normal, reasonably well-fed human 
beings is not encouraging (23, 42), interest in this approach does not seem 
to wane. One substance employed very recently for this purpose is Vitamin 
Bie. Two amateur athletic teams and one professional boxer are known to 
the author to use this nutrient regularly. Also, the vitamin has been recom- 
mended as a diet supplement for underweight athletes (40). Yet the scien- 
tific basis for this practice has not been established. Because it appears im- 
portant for physical educators and health educators to have at their disposal 
a summary of pertinent literature on Vitamin By2, this review is presented. 


With the observation of Minot and Murphy in 1926 (44) that pernicious 
anemia can be aided by liver feeding, the quest was begun for the specific 
factor or factors involved, This work finally resulted in the isolation of Vita- 
min By», a water-soluble vitamin, in England and the United States at about 
the same time (44). 


Source and Requirements 


Viamin By» is one of the most recently isolated dietary requirements and 
also one of the most potent therapeutic nutrients known. It is estimated to 
have 10,000 times the potency of refined liver extract in treating pernicious 
anemia (20). The chemical formula for this substance or its various forms 
(Vitamin By2., Biz», Bize) need not concern us here except possibly we might 
point out that the molecule contains carbon, hydrogen, nitrogen and oxygen 
plus a heavy metal, cobalt, and phosphorus. Because of the cobalt, biological 
isotope studies are practical. Natural B,2 is not found in vegetable tissue. It 
is synthesized by micro-organisms exclusively (43), probably in the rumen 
of animals (50). Beef liver and kidney are rich sources of the vitamin and 
some is contained in other meats, egg yolk, cheese, and casein (43, 48). 


The human requirement is probably only about one to two micrograms per 
day but healthy subjects apparently are not saturated with the vitamin since 
as much as 55 micrograms have been retained after a massive dose (4). De- 
ficiencies are believed to be rare among healthy men and women (24). This 
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would seem to be especially true among athletes since they are frequently 
heavy meat eaters, 

Despite the potency of this vitamin, it is relatively non-toxic. As much as 
1,000 micrograms by injection or 10,000 micrograms given orally in one 
dose have not resulted in toxicity (4). 


Vitamin B,. and Red Blood Cell Formation 


Hemoglobin contained in the red blood cell is important in athletics, par- 
ticularly because of its role in the transfer of oxygen from lungs to tissues 
and carbon dioxide from tissues to lungs. Vitamin By» is capable of raising 
the hemoglobin level of the blood but these effects have been demonstrated 
only in cases of pernicious anemia (6, 18, 20, 30, 48), macrocytic anemia 
(20), sprue’ (6), and among malnourished children (46). In normal sub- 
jects the vitamin had no effect on hemoglobin, red blood cell count, or 
eosinophil level (33). In experimentally-anemic B,.-deficient animals, the 
vitamin elicited a blood response (13, 17, 34, 35, 51) but in normal animals 
the hemoglobin was not affected (25). It is generally believed that vitamin 
Bie is necessary in the maturation of red blood cells from reticulocytes, the 
precursor of erythrocytes (2, 48, 50). One writer contends that Bio is used 
directly in red blood cell formation and not as a catalyst (36). 

In addition to those diseases in which the beneficial effects of By. have 
been established, Wetzel (54) claims to have been successful in treating one 
case of asthma with vitamin Bj», but this was not confirmed in a later study 
(21). Temporary relief of trigeminal neuralgia with B,. has been reported 
(49). Nerve regeneration and hence better co-ordination have also been re- 
ported as a result of B,2 supplementation in pernicious anemia patients (15). 


Vitamin B,. and Metabolism 


Amino acids derived from the breakdown of consumed protein are the 
building blocks of body tissue, One of these essential amino acids is methio- 
nine. It is now well established that vitamin B,2 has a decisive role in the 
bio-synthesis of methyl groups (3, 7, 9, 10, 12, 18, 22, 26, 37, 41, 47). Prob- 
ably the By. is needed as a co-enzyme in the manufacture labile methyl 
groups. This would explain the increase in B,2. requirement when a high 
protein diet is taken. In one study, Vitamin B,2 was reported to be necessary 
in the metabolism of glycine; another, amino acid (29). Absence of Vita- 
min By» resulted in early death among animals on a high glycine diet. An- 
other interesting application of B,2, related to its tissue-building function is 
that of decreasing wound-healing time (14). 

In addition to its established importance in protein metabolism, B,2 is be- 
lieved to be involved in carbohydrate and possibly fat metabolism (4). It 
apparently increases carbohydrate utilization, since rats which were deficient 


1A disease in which the blood picture is very similar to that which occurs in perni- 
cious anemia. 
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in this nutrient had smaller carbohydrate reserves and intravenous glucose 
was used poorly (28). In another animal study, high blood sugar in the 
glucose tolerance test was also noted (27). An interesting experiment reported 
by Edwards and Carter (11) utilized rats fed a protein-poor diet which re- 
sulted in a negative nitrogen balance and high fat deposits in the liver. Feed- 
ing 50 micrograms of methionine brought the nitrogen to equilibrium but 
the animals continued to have fatty livers, When By. was added to the methio- 
nine, the liver became normal and a 50 per cent increase in production of 
hemoglobin was noted. It is now known that methyl synthesis cures the fatty 
livers. 


Vitamin B,. and Growth 


Probably because of its role in protein metabolism, Vitamin B,2 has been 
demonstrated to be essential to normal growth among various animals (3, 19, 
41, 45). The usual experimental procedure in these studies has been to feed 
animals a diet deficient in By. and then add the vitamin to part of the 
group, keeping the others as controls on the B,»- deficient diet. An interesting 
observation in one study of chicks was the myoatrophy of the legs, which was 
very marked in Bj» deficiency (38). 

Studies on human beings have yielded conflicting results. In normal adults 
no significant effects on weight were noted with By. supplementation (1). 
Bye also had no effect on the growth of premature infants (8, 39, 44, 46). In 
a well-controlled study of children selected as having “simple growth failure,” 
oral feeding of Vitamin By» had a marked stimulatory effect on growth (53). 
Some studies have confirmed this (4, 54) but several did not (46, 52). How- 
ever, evidence in general points to improvement in growth with By» in chil- 
dren, provided those selected might be expected to have a deficiency in this 
nutrient, there is ad libitum access to food during treatment and there is ade- 
quate nutrient and caloric distribution in the diets offered (16). 


Vitamin B,. and Work Performance 


Wetzel (53) observed no effect on leg, back, or grip strength among the 
malnourished children who benefited in growth from By» feeding, In normal 
young men, Montoye ef al. (32) were able to show no effects of B,2 supple- 
mentation on grip strength, pulse rate recovery after moderate exercise or 
maximum performance on a bicycle ergometer. In a later study (31) with 
young institutionalized boys, no effects of B;» were observed on the step-test 
score or half-mile-run time. Both the control and experimental groups lost 
weight in this study, but, interestingly enough, the thinner boys (B-1 Chan- 
nel, Wetzel Grid) in the control group lost twice as much weight and ex- 
hibited four times as much channel shift as the thin boys on Vitamin By». 
The number of cases, however, was too small to consider this anything but an 
indication of a need for repeating the study. 

The lack of effect on performance is not surprising in the light of what is 
known about Vitamin By,» and the lack or rarity of deficiencies of this vita- 
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min among healthy subjects. There does not seem to be sufficient evidence 
to recommend Vitamin B,2 supplementation as a general practice among 
athletic teams. 


Oral Versus Parenteral Feeding 


There is little doubt that the injection method of supplying Vitamin By» is 
more efficient than oral feeding (4, 5, 6, 50, 55). The injection dose for per- 
nicious anemia patients is one gamma per day by injection, but orally it is 
50 to 75 gamma per day. However, injections are not practical when medi- 
cal supervision is not available, as is frequently the case in athletics, industry, 
and military operations. Furthermore, there is ample evidence that oral 
administration results in Bj activity in the urine (4, 55) and higher plasma 
values (55). Evidence that oral doses are effective among deficient animals 
and human beings is available in the many controlled studies quoted earlier 
in which growth was stimulated. Blood responses have been secured with 
oral administration in many independent studies with cases of anemia. The 
inclusion of the intrinsic factor reduced the required oral doses in anemia 
cases (2,6), but that probably can be explained by a deficiency of this factor 
in pernicious anemia. It is commonly believed that Vitamin By. is used di- 
rectly from food and the intrinsic factor of gastric juice aids in the absorption 
and not the synthesis of Vitamin Bj. (48). It is doubtful that the administra- 
tion of gastric juice would increase absorption in normal subjects. 
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Abstract 


The Kraus-Weber Test of muscular fitness was administered to a population of 1456 

elementary school children. Results indicate: 

1, The Test and Test items are highly reliable. 

2. There is no relationship between grip strength and success on the Test. 

3. Girls are substantially superior to boys at all levels in passing the Test, and in 
flexibility. 

4. For both sexes, there is a decided decrease in strength item failure as age in- 
creases, By the time the children are 11 years of age the percentage of failure on 
any item is less than 8 per cent, and agrees very closely with the results found for 
European children. 


IN THE MAY 1954 issue of the Research Quarterly and the December 1953 
issue of the Journal of Health, Physical Education, and Recreation, Dr. Hans 
Kraus and Mrs. Ruth P. Hirschland reported the results of their research on 
the muscular fitness of children. The information revealed in these reports 
should have aroused the concern of every person interested in the health and 
fitness of the young people in our Country. The apparent deficiencies of 
American children as compared to the children of other countries present an 
alarming picture. 


It was the purpose of this investigation to measure a city population of 
Indiana children in an attempt to determine how their muscular fitness test 
scores compared with the scores of the children measured by Dr. Kraus and 
Mrs. Hirschland. 

Other purposes of the investigation were to determine the reliability of the 
Kraus-Weber Test, to compare the Test results to results from a test of dyna- 
mometric strength, and to analyze performances on each of the Test items by 
age and by sex. 

The city chosen for the investigation can reasonably be considered a typical 
midwestern city for its population size of 26,000. It has four public elemen- 
tary schools and all of the physically normal population of these schools, 
grades one through six, who were present in school on the days the investi- 
gators visited, were given the muscular fitness tests. Rural schools within the 
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City School District, religious schools and laboratory schools were not in- 
cluded in the study. 


Procedures 

Seven faculty members of the Indiana University School of Health, Physi- 
cal Education, and Recreation composed the team which administered all 
tests. One member of the team had been trained and certified by Dr. Kraus 
in the administration of the Kraus-Weber Test. Ths team member was re- 
sponsible for supervising and instructing other members of the team in 
proper testing procedures. A preliminary training period was held for this 
purpose. 

The seven testors worked as a team througout the entire testing period. In 
some schools a special testing room was assigned with seven tables in the 
room and the subjects came there to be tested. In other schools, the team 
moved from room to room and used the double station desks available, for 
purposes of giving the tests. 

In addition to the original administration, repeats of the t test were given in 
order to determine test reliability. All repeats were given within five to fifteen 
minutes of the original test. Repeats were so distributed that schools, ages, 
and sexes were reasonably represented. The repeat of the test was always 
done independently by some other testor than the one administering the 
original test. A new scoring card with complete information was used for 
each repeated test, and these cards were matched up with the original ones 
during the subsequent analysis of the results. 

A manuometer! was used to determine the grip strength of a sample of 
children. Grip strengths were secured from the fourth, fifth, and sixth-grade 
children of one of the schools. The younger children were excluded, since the 
manuometer is rather difficult for them to handle. 


Test Description 


The Kraus-Weber Test is composed of six items, with failure on any item 
interpreted as a Test failure. The six items are described briefly as follows:? 

1. Flexibility—ability to bend over slowly and touch the finger tips to the 
floor without bending the knees. 2. Abdominals plus psoas—supine position, 
legs straight and hands behind the neck. Ability to sit up with support being 
given at the ankles. 3. Abdominals minus psoas—the same as abdominals plus 
psoas, only with the knees bent, feet on table. 4. Psoas—supine position, 








1The standard grip dynamometer was used, which provides no adjustment for variations 
in hand size. Purchasable from the Narragansett Gymnasium Equipment Company, Cen- 
tralia, Missouri. 

2For a complete description of the Test, see Kraus, Hans, M.D., and Hirschland, Ruth 
P. Minimum muscular fitness tests in school children. Research Quarterly, 25: 178-188, 
May 1954. 

The description of the muscle actions involved in the performance of the various test 
items represent the opinions of Kraus and Weber, and it was not the intent of this re- 
search to evaluate their analysis. 
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hands behind neck, legs straight. Ability to raise the legs with feet approxi- 
mately ten inches off the table and hold for ten seconds. 5. Upper back— 
prone position, hands behind the neck, supporting pillow under the abdomen. 
Ability to raise the trunk and hold for ten seconds, with operator holding feet 
down. 6. Lower back—prone position, knees straight, arms on table, sup- 
porting pillow under abdomen. Ability to raise the legs and hold for ten 
seconds, with operator holding upper back down. 

The last five items are considered by Kraus and Weber to be strength tests 
of the muscle groups concerned. One trial is given on each item of the Test, 
and the subject is passed or failed on this basis. 


Analysis and Discussion of Data 

The tests were administered to a total of 1,473 children, ages 6 through 
15. Since only 17 of the children were over 12 years of age, these 17 were 
eliminated in reporting the results. Thus, the report deals with 1,456 chil- 
dren, ages 6 through 12. 

RELIABILITY OF THE TESTS 

The tests were repeated with 215 children, and analysis of the results re- 
veals the tests to be highly reliable. Since the Test itself is scored as either 
passed or failed, as is each item of the Test, the reliability analysis necessi- 
tated the use of a technique adapted to a double dichotomy. 

On two of the items, upper back and lower back, there was complete agree- 
ment between the first and second adminstrations. On other items and the 
Test itself, the degree of agreement was so high that the tetrachoric coefficient 
and the Phi coefficient both estimated the relationship to be slightly over 
one. Obviously, neither of these techniques was suitable, since the degree of 
agreement, while very high, was not perfect. A third technique, which may 
be used in estimating relationship from a two by two table, is Yule’s Q (4, 
p. 210). This technique yielded the results given in Table 1. 


TABLE 1! 


Reliability Coefficients Determined by Yule’s Q for the Kraus-Weber Test and Four of 
the Items of the Test 








Kraus-Weber Abdominals Abdominals 
Test Flexibility plus psoas minus psoas Psoas 
954 954 986 974 990 











Since all of these coefficients are above .950, it can be seen that the tests 
are remarkably reliable, particularly in view of the fact that seven different 
people were involved in making the judgments. 
GRIP STRENGTH RESULTS 

There is substantial evidence in the literature (4, p. 89; 5, p. 503; 7, p. 65; 
8, p. 768) to the effect that grip strength shows a considerable relationship to 
the general muscular strength of the body, as measured by the dynamometer. 


Correlations between grip strength and general strength factors range as high 
as from .650 to .763. 
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The grip strength of the preferred hand of a sample of 126 children, 61 
boys and 65 girls, was measured by the manuometer. This sample included 
all of the girls and boys in the fourth, fifth, and sixth grades of one of the 
schools. 

The mean grip strength of the 30 boys who passed the Kraus-Weber Test 
was compared to the mean grip strength of the 31 boys who failed the Kraus- 
Weber Test. The mean grip strength of the “passes” was found to be 45.6 
pounds and of the “failures” 46.1 pounds. The difference between the means 
was evaluated by the t test, which indicated that the difference lacked sta- 
tistical significance (t = .23). The same comparison was made for the girls, 
in which the 35 “passes” had a mean grip strength of 39.2 pounds and the 30 
“failures” a mean grip strength of 42.7 pounds. The difference between these 
two means also lacked statistical significance (t = 1.30). 

The grip strength of all of the boys was compared to that of all of the girls. 
The mean grip strength for boys was found to be 45.9 pounds and that for 
girls 40.8 pounds. The difference between these means was found to be sig- 
nificant at better than the 1-per-cent level of confidence (t = 2.83). 

The mean ages for the four groups—pass and fail boys, and pass and fail 
girls—were very similar, not varying more than one-tenth of a year. 

These results would appear to indicate that the Kraus-Weber Test is not 
related to general muscular strength insofar as the grip strength test can be 
considered to be measuring general muscular strength. These results are not 
surprising, however, since the majority of failures in the sample were deficient 
on the flexibility phase of the Kraus-Weber Test. The small number of mus- 
cle weakness failures made a comparison of just those failures hardly worth- 
while. 

COMPARISON OF INDIANA RESULTS WITH KRAUS RESULTS 

Table 2 presents a comparison between the results obtained by Kraus and 
Hirschland in the population they measured and the population measured by 
the Indiana investigators. For purposes of convenience in drawing compari- 
sons, the populations will be referred to as the Kraus and Indiana Groups. 


TABLE 2 
Comparison of Failures in Total Populations, Kraus and Indiana Groups 








Per Cent Per Cent | Per Cent 
Incidence of Test Flexibility Per Cent Incidence 
Group Failures Failures Failures Weakness Failures 
Kraus —__ neal 80.0 57.9 44.3 35.7 
Indiana 53.9 45.1 36.3 17.7 











A glance at the figures reveals immediately that the Indiana group is some- 
what superior to the Kraus group in all comparisons. The most striking dif- 
ferences occur in incidence comparisons. 

In arriving at incidence figures for the Test, the number of flexibility fail- 
ures, abdominal failures, psoas failures and back failures are summed. The 
per cent that this sum is of the total Tests administered yields the incidence 
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percentages. A child is counted as an abdominal failure if he fails on either 
or both of the abdominal tests and a back failure if he fails on either or both 
of the back tests. Thus, a child could be counted four times if he were to fail 
on each of the four areas of the Test. Incidence of weakness failures is found 
in the same manner, except the flexibility item is eliminated from the count; 
thus, a child could be counted three times if he failed in each of the weakness 
areas. 

The figures in Table 2 reveal that the Kraus group contained larger per- 
centages of multiple failures on the Test than did the Indiana group, and that 
multiple failures in the strength test areas are apparently contributing more 
to the wide difference in the incidence figures. The difference in per cent of 
flexibility failures for the two groups is only 8.0 while the difference in per 
cent of incidence of weakness failures is 18.0. It is not possible to compare 
the two groups in terms of number of children with weakness failures, since 
those figures are not included in the Kraus report. 

The comparison of failures at different age levels is given in Table 3. 


























TABLE 3 
Comparison of Failures at Different Age Levels in Kraus* and Indiana Groups 
[ Age par Re ae 

Failure S Group | 6 |. 3 % :, ar Ea eS il * 12 og 
Per Cent | 
Incidence of |Kraus ...| 76 73 8 | 6 | 69 78 117 91 
Failure Indiana | 71.1 | 534 | 434 | 518 56.3 50.2 49.3 
Per Cent Test 
Failures Kraus 54 55 52 | 54 58 62 65 
Per Cent Flexi-|Indiana | 54.1 | 44.9 | 36.1 | 408 | 504 | 445 | 449 
bility Failures |Kraus | 33 | 4 | @ | 39 44 40 55 
Per Cent Inci-\Indiana _| 30.7 | 325 | 311 | 361 | 45.1 | 414 | 420 
dence Weak- |Kraus | 39 | 37 | 27 | 30 | 32 40 | 36 
ness Failures |Indiana —| 40.4 | 208 | 123 | 15.7 | 11.2 | 88 | 72 





*Percentages estimated from graphs in Kraus Report (3). 


Apparently an error has been made in graphing the figures for the age 11 
group in the Kraus report. The incidence of failure graph shows a tremen- 
dous peak at 11 years, estimated at about 117 per cent. Since the sum of the 
per cent of flexibility failures and the per cent incidence of weakness failures 
should add up to the total per cent for incidence of failure, it can be seen 
that an error exists. It is most reasonable to think that the 117 per cent is in 
error, since the reason for such a large increase in incidence of failure would 
be rather difficult to explain. However, since it is impossible to determine 
where the error lies, this age level will be discounted in the comparisons. How 
the error would affect total figures reported would depend on where the error 
was made. Other smaller inaccuracies noted can be attributed to errors in 
plotting, or reading exact figures from the graphs. 

Again, it can be noted that the Indiana group is somewhat superior to the 
Kraus group, in that percentages of failures tend to be lower at all age levels, 
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except at 6 years. However, in terms of per cent of test failures and per cent 
of flexibility failures the general trend of failures is very similar for both 
groups. It is in incidence of Test item failures and incidence of weakness 
failures that the discrepancies are greatest. The incidence of weakness failures 
for the Kraus group shows a decided downward trend from age 6 to age 8, 
but then for some reason takes an upward swing to age 12, while for the In- 
diana group there is a constant decrease with age, dropping from 40.4 at age 
6 to 7.2 at age 12. It is here that it would be important to have a comparison 
between the two groups in terms of the per cent of weakness failures. Be- 
yond the age of 8, the incidence of test failures is increasing more rapidly 
than the per cent of test failures, in the Kraus group. This would imply that 
there is an increase in multiple failures, but not necessarily an increase in 
per cent of weakness failures. 

Kraus and Hirschland state that “. . . children coming into the first grade 
of the school system are already seriously deficient. Furthermore, it appears 
that we are unable to alleviate this situation during the time the children are 
in elementary schools. They leave elementary school in very much the same 
condition as when they entered it—if anything a little worse.” (3, p. 185). 

It is the intent of this discussion to suggest that if the Kraus figures for per 
cent of weakness failures were available that a picture more like the one found 
for the Indiana group might emerge. In that case, it would mean that while 
there were more children with multiple weakness failures in the Kraus group, 
there would not be such discrepancies in per cent of weakness failures nor 
would it be valid to conclude that the children leave the elementary schools 
in much the same condition as when they entered. They would be increasing 
in flexibility failures, but decreasing in numbers of children with strength 
failures. 

Owing to the fact that the Indiana group had relatively few multiple 
failures, their incidence and per cent of weakness failures are very similar. 
ANALYSIS BY SEX AND AGE OF THE INDIANA GROUP 

The numbers of children, boys and girls, which were measured at each age 
level are given in Table 4. 























TABLE 4 
Numbers of Boys and Girls at Each Age Level in the Indiana Group 
ICRA AA A NARI "ee RE 
Sex 6 7 | 8 | 9 | re. Sal ee we i __ Total 
Boys | 108 145 124 88 | 114 121 35 735 
OI ccs: cstees a. aoe 138 120 103 | 110 106 34 721 
Total | 218 | 283 | 244 | 191 | 224 | 227 | 69 1456 











The data in Table 5 reveal the differences in per cents of Test failures that 
exist hetween the boys and girls of the Indiana group. 
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TABLE 5 
Number and Per Cent of Test Failures at Different Age Levels for Boys and Girls of the 
Indiana Group 
Age_ 
6 7 8 | 9 10 11 12 





Sex | n | % n! % n|l % nl % nl % nl! % ni! % 


Boys —| 64 59.3 | 75 51.7) 58 468] 49 55.7| 60 526 | 60 496) 17 486 
Girls | 54 49.1 | 52 37.7 | 30 25.0 | 29 282) 53 482) 41 38.7) 14 41.2 


























At every age level, the girls show a decided superiority over the boys. Since 
these tests are designed to measure minimum fitness or to determine the mus- 
cular fitness levels of children, we may only conclude that if these tests are 
valid, our girls are far more muscularly fit than our boys. The natural ques- 
tion is to ask why this should be so. In explaining the differences between 
the American and European children, Kraus and Hirschland state “The major 
difference between these two groups is the fact that European children do 
not have the ‘benefit’ of a highly mechanized society; they do not use cars, 
school buses, elevators, or other labor-saving devices. They must walk every- 
where—even to school, frequently a long distance. Their recreation is based 
on the active use of their bodies. In this Country the children are generally 
conveyed in private cars, or by bus, and they engage in recreation as specta- 
tors rather than as participants.” (3, p. 183) There may be some truth in 
these statements and it possibly accounts for some part of the differences 
found to exist between American and European children. But, this reason- 
ing can hardly be applied to the differences observed between boys and girls. 

It is felt that the complete breakdown of the figures for each test, by sex 
and by age, that appears in Table 6 will give at least a partial answer to this 
question of differences both for the countries and for the sexes. 

Reference to Table 6 reveals that at every age level the girls are superior to 
the boys on the flexibility item, while on all other test items the differences 
between the two sexes are minor and favor neither sex consistently. Hence, it 
is largely due to the differences in flexibility of the two sexes that the. girls 
would be termed as more muscularly fit on the basis of the Test results. It is 
not a new concept that bovs are somewhat less flexible than girls, and it does 
not seent reasonable to characterize the boys as more likely to be health 
failures or less muscularly fit than the girls, simply because fewer of them 
can touch their finger tips to the floor with their knees straight. 

The facts, that inflexibility increases with age for both sexes and that fail- 
ures on all of the other items decrease with age, result in a fairly constant 
per cent of Test failure at all age levels when the sexes and failures on indi- 
vidual items are combined. When these distinct sex and age differences are 
obliterated by summing them all together, it appears that the children are 
failing badly at all age levels and that no changes are occurring. For the 
Indiana group, this is far from the truth. It also might be observed at this 
point, that in view of these distinct sex and age differences, it hardly seems 
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reasonable to call any muscular fitness test a minimum standard regardless of 
age and sex. 
















































































TABLE 6 
Number and Per Cent of Failures for Each Test Item, by Age and by Sex, for the 
Indiana Group 
Age < 

6 7 = ae 19 els > TE Pe AN 
Test? Sex nl % n| % n| % nl % n| % nl % | nl % 
F Boys | 44 40.7| 61 42.1] 52 41.9) 45 51.1] 58 50.9| 57 47.5| 16 45.7 
Girls _..| 23 20.9} 31 22.5| 24 20.0| 24 23.3) 43 39.1) 37 34.9] 13 38.2 

A Boys ___| 1110.2; 5 34; 1 8) 4 45] 0 0 8 25/).:0:0 
+ Girls __| 1412.7; 10 72) 1 8] 4 38) 0 0 3), .- B41, 28 
A Boys —_| 38 35.2| 2819.3} 10 81] 11125) 6 53) 9 75) 1 2.9 
™~ iGirls _.| 36 32.7) 25 181] 11 92| 14136) 1311.8). 5 47) 2 59 
p Boys __..| 38 28) 4 231 °3. 24) 2°28! 1° 91] 3 25} 1° 29 
Girls | 4 36] 2 14] 3 25| 2 19| 5 45] 8 28) 1 29 

UB Boys _..| 2 19) 0 0 0 0 0 0 0 0 0 0 0 0 
Girls _..| 2 18) 0 0 : Bi °6°O 0 0 0 0 0 0 

i Boys _.| 00] 00] 1 8| 00/00] 00] 00 
LB icirs | 3 27, 00 | 1 8| 110|/ 00| 00] 00 
iF — flexibility; A+ — abdominals plus pecas; A- s shéeminels mines psoas; P— psoas; 


UB — upper back; LB — lower back. 


The greatest per cent of failure on the strength items is on the abdominals 
minus psoas and it appears to be very valuable in detecting weaknesses in the 
muscle groups concerned. It is heartening to note, however, the very rapid 
decreases occurring in both sexes as the age increases—the boys decreasing 
from 35.2 per cent failure at age 6 to 2.9 per cent at age 12, while the girls’ 
failure rate progresses from 32.7 per cent failure at age 6 to 5.9 per cent 
failure at age 12. 

The psoas test results show only small percentages of failure for both sexes 
at all ages. The average failure for the boys is 2.3 per cent and for the girls 
2.8 per cent. 

In the entire population of 1,456 children, there were only 5 failures on the 
upper back test, 6 failures on the lower back test, and none after nine years 
of age on either test. It would appear that, for the Indiana group at least, 
these items contribute little to the Test and could be eliminated without 
noticeably affecting the results. If back strength is to be measured, as part of 
a fitness test, other more discriminating items need to be substituted. 

The figures of Table 6 indicate that after the Indiana boys and girls reach 
11 years of age there is no strength item on which they show as much as 8 
per cent failure. At this age, these boys and girls are coming very clese to 
the European children in the strength items. The major discrepancy in Test 
failure must be attributed to differences in the flexibility item. The cause of 
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this difference in flexibility, as measured by ability to touch the finger tips to 
the floor with knees straight, and its significance to muscular fitness are 
problems needing further exploration. 


Summary and Conclusions 


The Kraus-Weber Test of muscular fitness was administered to a population 
of 1,456 elementary school children from an Indiana city. Repeats of the 
test were given to 215 children, and the grip strength of the preferred hand 
was measured for 126 of the children. 

The Test and the individual items of the Test were all found to be highly 
reliable as evidenced by coefficients exceeding .950 in all cases. 

No relationship appeared to exist between the Kraus-Weber Test and the 
Grip Strength Test. The differences between the mean grip strength of chil- 
dren passing the Kraus-Weber Test and those failing this test lacked statistical 
significance. This was true for both boys and girls, although a significant 
difference between the sexes was found, in favor of the boys. 

The Indiana group was found to be somewhat superior to the Kraus group 
in all failure comparisons. However, the per cent of test failures and per cent 
of flexibility failures at the different age levels showed very similar trends 
for the two groups. 

The major discrepancies between the two groups were in incidence of Test 
failure and incidence of weakness failure. 

The analysis of the Indiana results by age and sex revealed the following: 

The girls are substantially superior to the boys at all age levels on the 
Kraus-Weber Test. This superiority is due to their greater success on the 
flexibility item at all age levels. 

For both sexes, inflexibility increases with age, with more pronounced in- 
creases for the girls at the higher age levels. As the children become older, 
the flexibility measures of the sexes are coming closer together. 

For both sexes, there is a decided decrease in strength item failure as age 
increases. By the time the children are 11 years of age, the per cent of failure 
on any strength item is less than 8 per cent. Their strength measures approach 
very closely those of the European children at this age. 

When flexibility and strength weakness figures are added together, without 
regard for sex, differences which exist are obliterated, and the results imply 
that no changes are occurring with age, in the children. Hence, these figures 
should be separated in order to reveal the true failure picture. 

Since not more than six children in the entire population of 1,456 failed on 
each of the back strength items, these items are not discriminatory and 
should either be eliminated or some other back strength tests substituted. 

The discrepancy in the flexibility of the Indiana children and the European 
children, as well as the discrepancy between the boys and girls in flexibility 
are not readily explained. There is need for further exploration of the reason 
for these differences and the significance of flexibility failure to muscular 
fitness. 
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Measurement of Kinesthesis ’ 
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Abstract 

This is an attempt to establish tests for measurement of kinesthesis. For the first sample 
of 100 college women, 28 measures of kinesthesis and 2 of motor ability were adminis- 
tered. For the second sample of 70 college women, 15 of the previous tests were given 
and one new one was added. Analysis was made of the quality of the test items and of 
the inter-relationship of tests. Most of the tests were adequate in reliability. No single 
test was sufficiently highly related to the criteria employed to be used alone as a measure 
of kinesthesis, though several combinations gave fair validity. Because of this fact and 
because of the very low number of significant intercorrelations, it was concluded that 
kinesthesis is composed of a series of specific functions. 


IN SPITE.OF THE FACT that physical educators have recognized for years 
that the sense of kinesthesis must have relationship to motor performance, we 
really know very little about how to identify the varying degrees of this sens- 
ory acuity. Efforts at measurement have been sporadic and not very fruitful. 
An attempt at definition and identity, which is essential as a starting point in 
measurement, is impossible because the facts are lacking. It represents a 
challenging type of investigation, but in analogy is a situation of trying to 
lift one’s self by the boots without even the boot straps to grasp. 


This investigator is not the only one seeking an answer to these issues. For 
the period preceding 1950, there are studies such as Bass (1), Young (25), 
Fisher (4), Phillips (14), Seashore (20), Honzik (6). Since 1950, they have 
become more numerous; there have been studies such as Stevens (21), Wiebe 
(23), Witte (24), Roloff (18), Henry (7), Espenschade (3), Riley (17), 
Garrison (5), Worchel (22), Mumby (12), Norrie (13), Lindsay (9), Meday 
(10), Mullin (11), Rehling (16). Studies are now coming not from two or 
three centers but from many different ones scattered from coast to coast. 


The earlier studies aimed at validating empirically selected test items, and 
they have opened the way for the later studies of factor analysis, correlation 
with learning rate, effects of practice, relationship to specific skills, and mea- 
surement of growth or maturation through childhood and youth. Why do 
more and more people devote time and effort to work on what seems to be an 
intangible? Because it is believed to hold at least a partial answer to the 
secret of individual differences in motor accomplishment and in learning rate. 
It can give the physical educator a tool with which to instruct and serve as an 
aid to understanding, guiding and motivating individuals. Method might be 


1Based on papers presented at the International Congress on Essentials of Physical 
Education for Youth, Connecticut Valley Colleges, April 13, 1954; and at the Research 
Section of the Central Association for Health, Physical Education, and Recreation Con- 
vention, Lincoln, April 9, 1954. 
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so improved as to lead to considerably greater accomplishments, and devel- 
opment of an ability in the learner to be more independent and more capable 
of self-directing his learning in the future. This at least is the faith on which 
this investigator continues, and is probably similar to that of others. 

Probably the most forceful statement of the significance of kinesthesis in 
motor performance comes from Ragsdale (15) in a discussion of motor 
learning. He makes statements such as “kinesthetic perception is fundamen- 
tal but must be interpreted and acted upon,” “blindfold practice makes for 
greater dependence on kinesthetic cues,” “manual guidance may help to de- 
velop kinesthetic perception of the activity.” He indicates many practical 
ways in which kinesthesis may be utilized to facilitate learning. It is faith in 
such a use that is bringing more and more efforts at relating kinesthesis to 
learning, measurement of the trait, and increased understanding of the ways 
in which it functions in motor behavior. 


Background of the Study 


The aim of these investigations is to establish a practical and valid method 
of determining kinesthetic function with a view to understanding better the 
learning process. The successive stages of work have required the following 
sub-problems, 

A. Identification of ways in which kinesthesis is demonstrated in motor 

performances 

B. Selection of tasks, based on these expressions of kinesthesis, suited to 

objective measurement 

C. Determination of the inter-relationship of measures and the contribu- 

tions of each measure to the concept of total kinesthesis 

D. Ascertaining the relationship of levels of kinesthesic acuity to learning 

facility and motor accomplishment 

Because of the nature of the problem and the lack of clear-cut evidence, 
the work has been characterized by replication, new hypotheses, gradual 
evolvement of new and revised forms of measurement. The data of this 
report are limited mainly to one phase of the work, with citation of a few 
points from earlier, unpublished studies. 

Kinesthesis has been recognized by both psychologists and physiologists as 
a sensory property of the human being with implications for motor accuracy 
and precision, individual differences, empathic appreciation, and even de- 
termination of neural normalcy. Kinesthesis is referred to frequently as the 
“motor sense” and for this investigator is defined as the sense which enables 
us to determine the position of segments of the body, their rate, extent, and 
direction of movement, and the position of the entire body, and the character- 
istics of total body motion. Stimuli pertinent to this sensation are carried by 
special receptors and nerves just as the stimuli of vision are carried by neural 
pathways peculiar to the eye, or the stimuli for smelling by neurons specific 
to the sensation of smell. 
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The receptors of kinesthesis are identified by the physiologist (2) as a) the 
muscle spindles, around the muscle fibers; b) the Golgi corpuscles in the 
tendons; c) the Pacinian corpuscles in tendons and articular cartilage of 
joints; and d) free nerve endings in the muscles, tendons and joints. They 
are stimulated by tension or pressure. The labyrinthine source of stimula- 
tion is activated by movements of the head when the head moves inde- 
pendently or in unison with the total. The vertical canals apparently function 
in antero-posterior or lateral movements while the horizontals are most helpful 
in the twisting, spinning movements. 

The eyes aid indirectly in contributing what might be called “orienting 
information” such as location of the horizontal, distances, changes in relative 
position of the organism, and movement of persons and objects about the 
observer. 

On this background of psychological-physiological perspective the various 
proposed tests have been developed with the usual assumption of face validity. 
The number of tasks that could be selected are almost limitless. Some do 
not lend themselves to objective measurement, some are not very interesting 
to subjects, some require elaborate equipment. If this is to be practical for 
the teacher, possible to use in the gymnasium as well as in the research 
laboratory, then special apparatus must be eliminated as completely as 
possible. 


Procedure 


Part A. Twenty-eight tasks were selected as meeting the standards of 
testing and as having face validity. Face validity, as used here, means that 
by hypothetical concepts based on present information concerning kines- 
thesis the tasks would appear to require some kinesthetic functioning. The 
tests were selected with the idea of sampling different phases of kinesthetic 
perception. This was based on a general study of the literature, previous 
experience with testing of this type, and results of a factor analysis study 
recently completed (12). These component phases are ability to repeat muscle 
contractions with a force identical to that which one has just exerted; 
ability to put the arms, legs and trunk in positions prescribed by visual or 
oral cues; balance and weight control; manipulative precision with the hand; 
orientation in space; and the ability to imitate promptly a simple co-ordina- 
tion which has been demonstrated, i.e., to translate empathic understanding 
into concept and thence to movement. To this series, broad jump (maximum) 
and motor ability were added as measures of accomplishment. 


TEST DESCRIPTIONS” 
1. Leg force—Required to extend leg with exactly 20 lb. of force. Score = deviation 
from 20 lb. 
2. Arm force—Required to extend arm with exactly 10 lb. of force. Score = deviation 
from 10 Ib. 


2Detailed test instructions may be obtained from the author. The abbreviated etate- 
ment is given here to give an idea of the task involved in the test item. 
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3. Pull—Required to pull with both arms exactly 10 lb. Score = deviation from 10 |b. 
4. Push—Required to push with both arms exactly 10 lb. Score = deviation from 
10 Ib. 

5 Balance leap-—Sideways leap, forward bend, moving object on floor and subsequently 
holding balance at least 5 sec. Score = time balance is held. 

. 6. Balance stick—Balance on one foot placed lengthwise on 1 in. square block, eyes 
closed. Score = total time. 

7. Weight shifting—Weight on one foot on solid block, shift weight toward other foot 
on scale until 4% weight is on each foot. Score = deviation from prescribed amount. 

. 8. Body sway—Center of gravity apparatus, standing on one foot sideways to scale, eyes 
closed, for 10 sec. Score = range registered on scale. 

9. Leg raising—Side lying abduct leg according to model shown (20°). Score = devia- 
tion from 20°. 

10. Thigh raising—Standing, raise leg, knee bent, till thigh is horizontal. Score = devia- 
tion from horizontal. 

ll. Arm raising—Standing, abduct arms to horizontal. Score = deviation from 90°. 

12. Wrist extension—Forearm supported on table, hand relaxed over edge. Extend wrist 
according to model shown (30°). Score = deviation from 30°. 

13. Finger spread—Forearm supported on table, fist closed, raise hand but not elbow 
and touch thumb and little finger on targets. Score = deviation from target center. 

14. Lower leg flexion—Standing on one foot, knee pointing downward, flex lower leg of 
free foot until lower leg is horizontal (90°). Score = deviation from 90°, 

15. Wrist flexion—Forearm supported on table, hand relaxed over edge. Extend wrist 
according to model (20°) (flexion). Score = deviation from 20°. 

16. Ball balance—Golf ball in small trough. Keep rolling constantly, crossing finish line 
but not contacting end of trough. Score = errors made in 30 sec. 

17. Target pointing—Standing sideways to target, pointer in hand, eyes closed. Raise 
pointer to target. Score = deviation from center of target. 

18. Horizontal lines—With eyes closed, draw 4 horizontal lines (parallel to one observed 
before closing eyes): (1) left to right with hand, (2) right to left with hand, (3) 
left to right with foot, (4) right to left with foot. Score = deviation from horizontal. 

19. Walk and turn—Facing down a straight line. With eyes closed, step right, and turn 
right, step left and turn left, repeat. Score = angle of deviation from the straight 
line. 

20. Broad jump specified—Stand toeing a line. With eyes closed jump so feet are just 
across a line 24 in. ahead. Score = deviation of heels from a 24-in. line. 

21. Floor target—Look at targets on floor and with eyes closed touch index finger to 
target point. Score = deviation of heels from a 24 in. line. 

22. Broad jump maximum—From take-off board, jump forward as far as possible. 
Score = distance from take-off to heel. 

23. Mator ability—Weighted sum of scores on broad jump (maximum), obstacle race 
and basketball throw for distance (19). 

24. Arm swinging—Four-count arm exercise, swinging forward and sideways, with alter- 
nate arm holding first count. Score = rating on precision and number of trials re- 
quired, 

25. Arm circling—Arm circling in opposite directions. Score = rating on precision and 
number of trials required. 


. The subjects were 100 college women selected at random. Their ages 
varied from 17 to 35 years, but were predominately in the 18 to 20 year age 
range. 


Reliability of tests with several trials was computed on the alternate halves 
and stepped-up with the Spearman-Brown formula. The tests with only one 
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trial were administered to part or all the subjects later in the testing hour; 
on other tests the subjects were asked to come back later for repetition of the 
tests. Thus, the number on which reliability was computed varied from 45 to 
100. The reliability was low on several tests. The five lowest were dropped 
from further consideration in this analysis. Intercorrelation for all the re- 
maining items were determined. A 5-per-cent level of confidence was adopted, 
for the 100 cases requiring an r of .195. 


The validity was computed by correlating with a battery recommended by 
Young (25) and with another recommended by Roloff (18). Formulas for 
these two batteries are: 

Sum of T-scores on Balance Stick + Leg Raising + Arm Raising (Young) 
.75 Balance Stick + 4.7 Arm Circling — Arm Raising — Weight Shifting + 50 (Roloff) 
Finally, multiple correlations were computed to estimate the validity ~f 7if- 
ferent combinations of tests. 


Part B. Subsequent to the analyzing of data obtained on Group A as out- 
lined above, data were collected on another comparable sample of 70 college 
women. Some of the tests were retained in the same form, trials were in- 
creased on some in an attempt to improve reliability; and one new test was 
added—triangle walk. This was a test recommended by Worchel (22). He 
says that for movement through short distances, kinesthetic cues are respon- 
sible for keeping the subjects oriented and that those with defective semi- 
circular canal function did more poorly on this type of test than those with 
a normal labyrinth. The total number of tests given was 18. As in the pre- 
vious studies, the subjects were tested individually, and were either blind- 
folded or had eyes closed. 

Computation was made of reliability of each test. Two were not considered 
further because of low reliability coefficients. The remaining 16 were used 
for intercorrelation and validity computations. As in the previous study, 
the Young (25) and Roloff (18) criteria were used. A third criterion was 
also used, one obtained from data on Group A of this report. 


Analysis of Data 


Part A. Table 1 contains the reliability of the tests for both groups. These 
compare favorably in those items which have been used previously. For the 
majority of tests used with both A and B groups, the coefficients obtained on 
the two samples are similar. 

Intercorrelations of all items showed very few significantly greater than 
zero, that is, unaccounted for by chance. These coefficients are more mean- 
ingful in smaller groups of items which had been put into the experimental 
battery on the hypothesis of similarity of kinesthetic function. Therefore, 
the intercorrelations are presented in relation to the hypotheses on which the 
trial battery was built. Analysis of these hyoptheses follows on data from 
group A. 
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TABLE 1 
Reliability of Kinesthesis Measures 








Group Previous Studies 
Roloff | Scott(b) 
| 200(a) | 104(a) 
r r 











SMe ae aN ce ities alee Sen keen 


PBN TOE fivceccotessersicns 


| SP ie ae nena Bee 


Push oo kw 
Balance leap -.. 


Balance stick 
Weight shifting _. 
Body sway (side) 


Leg raising — BMI: if z 
PRE CUI chiiseecresetineinnitinnccecneernaebioeriiat 


Arm raising ——.... 


UM NRE speciosa 
OS cc 
Lower leg flexion >>» 
I I ices So 


Ball balance 
Target pointing (pointer) 
Horizontal lines 





Walk and turn (angle) + 
Broad jump (specified) 


BE hin csintnlineetsinneii 
AN Ae ioe AON 


Arm circling ~~. 
Vertical lines —_. 


Body sway RS a ORR LMI 


Target pointing (finger) 
Trunk horizontal 





Walk and turn (distance) 


IE WEI ioc eecctteceecdce ice i. 











a Number of cases on which coefficients were computed. 


b Unpublished study. 


¢ Computed on alternate halves and stepped up with Spearman-Brown formula. 
4 Difference between r of A and B group is significant at the 5% level. 


A. Muscular contraction of a known amount is a function of kinesthesis 
and each subject should be equally accurate with various muscle groups. 


The items which would be expected to be related on this basis are listed in 
the first part of Table 2. Other tests were related significantly to one or 
more of those listed in Table 2 and these coefficients are also shown. 
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TABLE 2 


Correlations Between Measures of Specified Amounts of Contraction and Others Which 
Show Significant Correlation with Them 





Measure 1 2 8 4 





I ad 
iS ~~“ pea 
vee ot A 
. Broad jump (spec) 








Arm swinging . 

Ball balance _ 

Balance leap - 

Balance stick - 

Horizontal lines 

Floor target 

Finger spread | .22 
Weight shifting . = 


Broad jump (max) . ;-— 





Interpretation: 

1. The use of the arms in adapting effort appears to be inter-related in 
these tasks. 

2. The use of the arms in adapting effort is apparently unrelated to 
similar function in the legs. 

3. Facility in adapting effort is general enough to affect performance in 
a variety of tasks, apparently facilitating control of the hands and arms. 

4, The inverse relationship of balance stick and floor target with the 
push is not readily apparent. 


B. Balance ability is a specific function of kinesthesis. 

Four tests were judged to measure some aspect of balance. Correlations 
of other tests with the balance items are so uniformly significant that they 
are presented also in Table 3. (See facing page.) 


Interpretation : 


1. Three of the four tasks selected have some inter-relationship, though 
less than might be anticipated. Balance stick and body sway are in a sta- 
tionary position while balance leap involves movement. This may partially 
explain the rather low r’s. 

2. The other items with which they are related represent in general com- 
plex skills, with relatively little balance, apparently seeming to be general 
co-ordination instead. 

3. The relationship with the arm movement is not very meaningful. 
It may be that the arm action in balance is a very significant one, not just 
a mechanical asset as sometimes is assumed in analysis of skills. 
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TABLE 3 


Correlations Between Measures of Balance and Others Which Show Significant 
Correlations with Them 





Measure 





. Balance leap —..____ 
. Balance stick ——— ~~. 
. Weight shifting 
Oy CG cena Nimwsiicaiiece 





Broad jump (spec) —_. 
a NOD ates 
Broad jump (max) 
Motor ability —.__ 
Arm swims 
Arm circling ~~. 
co | anne 











C. Ability to assume and identify body position is a general capacity operat- 
ing throughout the body. 


TABLE 4 
Correlations Between Measures of Ability To Assume and Identify Body Position 








! 
Measure 6 
| 


. Leg raising 

. Thigh raising ——_ 
. Arm raising —.———.- ‘ ll | | 
. Wrist extension — i . j 
. Wrist flexion ¥ . 02 | —06 |. | 


. Lower leg flexion —,02 
. Finger spread —16 | —.02 











The only other items related to this group are arm circling and arm swing- 
ing, related to leg raising (.32) and arm raising (.19), respectively. 

Interpretation: 

1. The hypothesis is completely untenable. Either the ability to identify 
position of the various segments is specific for each segment or some factor, 
such as variations in experience, leaves these subjects using their potentiali- 
ties in very different ways in the different segments. 

2. While all the coefficients except one are insignificant, the majority 
are negative. This is in disagreement with previous studies such as Young 
(25) and Roloff (18). It is probably a chance occurrence. 
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D. Precision in use of the hands is @ basic function in manipulative skills 
and is made possible by perception of hand and arm position and 
movement. 

The four items selected on this hypothesis failed to show any relationship. 

However, there were a number of other tests which were related to them. 


(See Table 5.) 


TABLE 5 


Correlations Between Measures of Manipulative Hand Function and Others Which Show 
Significant Correlation with Them 









































Measure 1 2 8 ‘4 
1. Ball bounce —.___ ios 
2. Target pointing (pointer) _ 08 
3. Horizontal lines _.__ “ 17 00 
4. Finger spread — —.10 08 ll 
SII Bitconcinconnncnssnsibasiincines 22 
Arm swinging ———___ . 19 
Am crcliog ......__.. .29 23 
Push - 24 
Broad jump (maximum) ___ —.19 
Broad jump (specified) 22 
Interpretation : 


1. These particular tasks are unrelated and the hypothesis of common 
function in these tasks is untenable. 

2. Ball balance and horizontal lines are related to arm functioning of 
various types and are understandable as such. Ball balance apparently is 
more an arm function than one of the hands, 

3. The relationship between finger spread and broad jump events is un- 
accountable. 

4. The lack of relationship of these items and wrist flexion and exten- 
sion lends further support to the probability of specificity of functions. 

E. Orientation of the body in space facilitates precise movement and ability 
in various types of tasks involving space orientation should be inter- 
related. 

Table 6 shows the four tests to be unrelated among themselves, though 

four other items are related to them. (See facing page.) 

Interpretation: ; 

1. The hypothesis is untenable; these tasks are apparently specific. 

2. The weight shifting and finger spread might be considered a form of 
space orientation; similarity between specified broad jump and weight shift- 
ing is logical, that with finger spread is not and seems to contradict the con- 
cept of specificity of functions. Worchel (22) tested ability of subjects both 
with normal and insensitive labyrinths, He says that for long distances 
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TABLE 6 


Correlations Between Measures of Orientation in Space and Others Which Show Signifi- 
cant Correlations with Them 














| 
Measure tg 1 | 2 | 8 
: ——|- abe 
L, Werk O06 (00Rik. ee : 
2. Broad jump ns —.00 
3. Floor target : —.08 05 
4. Target pointing —.______| 04 —.13 —.08 
Leg force ____. ana —.25 
Weight shifting _—.._____| .20 
Finger spread aR .22 
PONE eictiitecsccnsspabicieiec ans —.26 





spatial imagery enters into orientation of the subject in moving in a specified 
angular path, but that on short distances, kinesthetic cues are mainly re- 
sponsible, He further stated that those with functional semi-circular canals 
did better on the tests than those without this function. All the tasks in this 
study would appear to be concerned with short distances. They do involve 
very different types of movement, however. 

F. Learning of a new skill is facilitated by kinesthetic cues which make for 

similarity of achievement in tasks to be learned. 

The total group of items shown in Table 7 represents what would appear to 

be very diversified capacities. 


TABLE 7 


Correlations Between Measures of Learning of New Skills and Others Which Show 
Significant Correlations with Them 























Measure ne 1 | 2 8 4 
1. Arm swinging - a | 
2. Arm circling — e A2 | 
3. Broad jump (maximum) - sta .26 24 
4. Motor ability — ~~ —_______. 30 31 84 
Tie’ Sed ti -| 24 | 
Balance leap —~.---.....| .20 .29 38 A3 
Balance stick —..._-___| .29 23 50 | 50 
Weight shifting aeplasiasiabaeks 21 19 | 
F:. | ees 19 
Ba tiene 19 .29 
Cee ee 32 | 
Horizontal lines — .23 
Finger spread — | | —.19 
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Interpretation: 

l. The hypothesis seems extremely plausible. It is the only assortment 
of hypothetically related items in which all are significantly related. 

2. Learning rate is related to accomplishment level. 

3. Whatever the source of varitation in learning rate is, it operates in 
many other tasks established in this study. The spread of relationship to 
other tasks, and the uniformity of positive relationship (only one negative) 
lend further support to the belief that previous experience is a contributing 
factor in learning and adapting to new tasks. Since these items were all 
selected on face validity, i.e., involving kinesthetic perception, one might 
conclude that kinesthetic perception is the connecting link between previous 
experience and learning of a new motor task. 

In considering intercorrelations of all items (see Table 10, p. 338), it is 
apparent that the relationships are in general low. They are lower than ob- 
tained in previous studies, and the question is raised as to the performance 
of the group and the test reliability. Table 1 showed the reliability to be not 
markedly different where comparison was possible on identical tests. The 
range and variance were found to be similar and in most cases large. Sub- 
jectively, it is believed that the subjects were not as uniformly interested 
and motivated as they might have been and as previous groups may have 
been. 

The face validity of the individual items was considered reasonable. No 
exact and completely verified criterion exists for statistical computation of 
validity. However, two of those which have been used previously were used 
in this study. The one was established by Young (25) and the second by 
Roloff (18). These results appear in Table 8. 

It is apparent first of all that very few of the tests are significantly greater 
than zero (.19 at the 5% level). For instance, on the Young battery only 
balance leap, arm swinging, and arm circling are significant, with horizontal 
lines, body sway, and leg force approaching significance. For the Roloff bat- 
tery the only significant r’s are on balance leap, arm swinging, horizontal 
lines, and walk and turn, with ball balance approaching significance. 

The multiple R’s and best eight combinations are presented in Table 9. 
The size of the coefficients for the first six are practically identical. There 
is little basis in the size of the R’s for choice between them. From the stand- 
point of administrative speed, the arm swinging and balance leap would 
appear optimum. The third, fourth, or fifth test does not add enough to pay 
for time consumed according to present evidence. 

Part B. Reliability of each item is given in Table 1. In comparing these 
coefficients with the ones on the earlier study, it will be noted that four of 
the coefficients dropped to the extent of a statistically significant difference. 
These four are arm force, push, leg raising, and arm raising. In two of these 
tests there have been several comparable samples on which reliability has been 
computed. In one case, this is lower than any of the others, viz., leg raising 
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TABLE 8 
Validity of Tests on Group A and B as Compared with the Young, Roloff and Scott 
Criteria 
Be ae 
Test Item } ee ae pa CoS = Scotta 
ee ee a 
! | 
Leg force _.... Os 2 te ESE 15 | RS | 
PR IE cascada ese eel —.01 O7 | 13 0 | 
Ree cree 00 
Push —_________-|_ — 03 —.07 —.01 —08 | .25 
Balence leap ...—-———-— tn 46 AZ | 31 17 | 
Denees OU one 24 
Weight shifting —...._________ ll | 
Bedy sway —__$_$______ .16 .00 09 33 | 43 
A ID sternceiiennecinininicccinrenaininergil 36, -—#2 —0% | 12 
Arm raising a scibcesaaied | 14 
Thigh raising eeiniaeeaneieed ee .00 
. eee een 02 04 
Finger spread | —.02 06 | —.07 13 | —08 
Lower leg flexion... Antica caeiel 13 ») 14 4. 
Wm ttn | =e —.11 
SD RII cinerea 02 Al | 18 ami 
Target pointing (pointer) ......__.__. 08 —05 | 14 09 | —.02 
Horizontal lines ——.._______ 19 05 | 23 02 
ae eee 08 10 | 20 10 | —.08 
TH WE. nn ceteinel 09 | 5 21 
| | 
WE atin cpasteriliccmniecomnnnens 04 .00 
Broad jump (specified) nnwe| —eO4 06 | 
Broad jump (maximum)-— 38 | 40 | 
Ber ARG so eee ee Al 36 
PASS CII ripecrsicetinciceeorcernsitecinlltil A4 14 | 47 49 | 
FI GI overs ecnpacterebcoccctaeee 40 .08 i) 








8 Scott criterion = 19 arm swinging + 8 balance leap — horizontal lines. 


.76 as compared with .93, .94, and .98. For arm raising the drop is probably 
to more nearly the true reliability to be expected—.81 as compared with .91, 
80 and .80 in other studies. In the other two tests there are only the two 
samples for comparison. 

The validity values on the three criteria are in Table 8. The coefficient was 
not computed if the test was a part of a battery. A few points seem to be 
apparent from study of data from various samples. 

1. Many of the tests agree fairly well from sample to sample and from one 
criterion to another in terms of level of relationship. 

2. Negative coefhicients are without exception insignificant. 

3. There is more agreement from one criterion to another within samples 
than between samples for each criterion. 
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4. The tests which correlate significantly with the criterion taken from 
Part A are push, balance stick, body sway, ball balance, triangle walk, and 
arm circling. This is a larger proportion of valid items than with either the 
Young or Roloff criterion for either this sample or the previous one. This 
may mean simply that the test array is more homogeneous and it may mean 
that the criterion is a more adequate one. 


TABLE 9 
Validity of Best Test Batteries 


Battery R 














COMPUTED ON DATA FROM Group A 
Young criterion: 
1) 7.0 arm swinging + 3.0 balance leap — .4 horizontal lines — 1.0 body 





COU ciitcnsirtningane 60 
2) 19.0 arm swinging > 8.0 balance leap — 1.0 horizontal lines _ wet 59 
3) 7.0 arm swinging + 2.5 balance leap — 1.0 os sway. aie ah 59 
4) 5.0 arm swinging + 2.0 balance leap... => 58 


Roloff criterion: 
1) 4.8 arm swinging + 1.0 balance leap — .4 walk and turn — .3 hori- 








zontal lines — .5 ball balance 58 
2) 16.0 arm swinging + 3.0 balance leap — 1.0 walk and 6 cure - — - 10 hori. 

zontal lines _ 58 
3) 16.0 arm swinging + 3.0 balance leap — 1.0 walk and Re eed 55 
4) 17.0 arm swinging — 1.0 walk and turn_ Scat ness daicpticicesee 51 


ComPuTED ON DATA FROM Group B 
Scott criterion: 

1) 4.0 arm circling — .9 push — 1.0 ball balance — .7 body sway_. 66 

2) 6.0 arm circling + .9 balance stick — .6 ball balance — 1.0 body a sway 63 

3) 100 — .8 triangle walk — 1.3 ball balance — 1.0 sheers sway. 55 

4) 6.0 arm circling — 1.0 body sway... 

5) 5.0 arm circling — 1.0 push____. 














Table 11 (p. 340) gives intercorrelations of tests for Group B. Of the 120 
correlations only 21 are significant. This may lend support to the idea of 
specificity of functions involved. Very few of the significant r’s are negative 
although there are many negative r’s in the table. The proportion of nega- 
tives is smaller than for the correlations obtained in Part A. It seems 
probable that many of the negative ones are the result of sampling variation. 


Multiple correlations were determined with various combinations of items 
found valid relative to the criterion from Part A. These appear in Table 9. 
The R’s on the shorter batteries are comparable to those correlated with the 
other criteria, but the longer batteries are slightly higher. 


HORS NB 





pur panne 
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Discussion and Observations 


Of practical significance, perhaps, is the fact that the tests involved in these 
new batteries are all more interesting to the subjects than were the Young 
items. This is a great help in getting full co-operation from the subjects. 

While these results are not as conclusive as one might hope for, there are 
certain things which seem to stand out from these data. 


1. A balance test shows up favorably in every study that has been made 
with a balance test included. Whether this means that the neural mechanism 
of balance is similar to that used in other test items or simply that acuity of 
labyrinthine function parallels proprioceptor function cannot be answered 
at this time. However, it seems advisable to put a balance test into any 
kinesthesis battery. The two used here have been consistently reliable and 
fairly valid. 

2. Arm swinging checks with both of these batteries as it did previously 
with a longer criterion of many test items. Its consistency is apparent. 
Analysis of the test seems to indicate two capacities involved, ‘.e., ability to 
perceive kinesthetic cues and to execute precise movement in plane, distance 
and timing. It might be considered a learning capacity, but on the other 
hand, it is more than perception of the problem. Subjects discussing the 
test afterward sometimes said they knew what they were trying to do, but 
they had not been able to execute the movements, becoming lost in the 
sequence, or were inaccurate in movement. Arm circling has similar char- 
acteristics, correlating .42 with arm swinging and having similar vanaity 
coefficients whenever we have used them. 


3. The ball balance and horizontal lines are arm manipulations and though 
they intercorrelate only .17, they might show up better comparatively if the 
trials were increased and reliabilities raised, especially the horizontal lines. 

4. The walk and turn, leg force, and body sway also need work toward 
improving the reliability, 

The correlations of motor ability and broad jump (which is a part of this 
MA battery) are reasonably high for each of the two criteria of kinesthesis 
and also with the balance tests. They approach significance on leg force, 
target pointing (with pointer), floor target, but appear to be inversely re- 
lated to finger spread. Correlations do not seem to be high enough to indicate 
that these are learned tasks. It can be assumed that it is some capacity re- 
lated both to learned skills and adjustments to new. motor acts.. At the 
present time it is hypothesized on the basis of experience in this area of testing 
that kinesthetic perception could be the common element. 

Actually, this question of relationship between kinesthetic functioning and 
motor achievement js the reason for all of our efforts pertaining to kinesthesis. 
Practical application is the ultimate objective in obtaining information of 
this type. It is hoped it will lead eventually to better understanding of the 
process of motor learning, of individual differences in motor learning, of 



































338 The Research Quarterly, Vol. 26, No. 3 
TABLE 10 
Intercorrelation of Kinesthesis Tests on Group A 
| 
Test Item 1 2 3 | 4 5 6 | 7 + y 10 
| 
- 

L. Leg force bod | | 
2, Arm force | .02 | | 
i PS a ee 
4, Push ...  J—12 | 19} 22 eo ae P 
5. Balance leap _...| 03 | .20 |—.03 |—.04 | | | 
6. Balance stick | 07| 05| 02|~21| .40 peat 
7. Weight shifting _| .06| .17 |—.02 |—.01 |—.04 .14 
8. Body sway (side)\—.13 | .01 |—.03 |—.04 | .12| .23 |—.01 
9. Leg raising _..__ —02 |—.00 |—.08 |—.05 | .18 |—.01 |—.06 | .14 
10. Thigh raising | 06 | .07 |~.01 |—.08 |—.08 | 07} .14| .14 |—.08 
11. Arm raising | .15 |—.02 |—.03 | .18| .00 |—00| 20 |—.15 |—.27 |—.11 
12. Wrist extension _.| .03 |—.09 |—.17 |—.10 | .12; .10 | .04 |—.12 |—.09 |—.09 
13. Finger spread _.|—.15 |—.14 |—.13 |—.07 | .06 —02 | .03 |—.10 |—.06 | .16 
14. Lower Leg flexion|—.12 |—.07 |—.05 |—.09 | 03 .11| .13} .03 | .12 |/—.10 
15. Wrist flexion _..| .08 —.00 |~.00 | .10 —.01 |—.13 |—.12 | 02 |—.05 |} .02 
16. Ball balance _...| .15 | .22 |~—02/| .17| .06/ .05/| .14} .04| .03 07 
17. Target pointing _| .17 |—.04 |—.06| .15 | .16/| .03 | .09 |—.12 |—.02 | .00 
18, Horizontal lines _|—.05 | .14| .07| .24| O01) .18| 03; 06; 05) .13 
19. Walk and turn_.|—.10 | .04/|—.02| .09| 03 | .12| .08 |—.04 | .03 |—.02 
20. Broad jump (spec)|—.25 | .06| 01} .02 |—.01 |--.10 | .20 |\—.04 |—.01 | .13 
21. Flood target —.|—.11 |~.09 |—.02 |\—.26 | .03 | .12 |—.04, .05 |—.01| .18 
22. Broad jump (max.)| .18 |—.01 |—.09 |—.10| .38| .50/| .09| .03 | .01 |—.08 
23. Motor ability | .08| .04 |~01/—.14) 43 | 50| 05| 04) .14 |—.01 
24. Arm swinging | .24/| .11 |—.08|—.06| .20/ .29;| 21; 05/ .13 | .03 
25. Arm circling —..|—,02 | .10; 03; .08; 29; .23/ 19] 02] .32 |—.05 


























new methods to employ this motor sense in the learning process, and the 
maturation of the capacity for meaningful perception. 

There are implications in these data which would appear to give direction 
to further studies. With the mounting evidence toward the indication that 
the kinesthetic sensation is a complex of many independent and specific 
capacities or component elements, certain steps are indicated. 

1. Analysis must be made to determine those components. 

2. Test items will need to be created for many or all of the components. 

3. Test batteries will doubtless be fairly large when developed ultimately 

in order to adequately understand the individual variations. 

4. Determination of relationship between kinesthetic acuity and motor 

earning will be more complex. Achievement in one activity may be 

related to certain elements and success in another activity more related 
to other aspects of kinesthesis. 


Conclusions 


In light of data obtained from these two groups of 100 and 70 cases re- 
spectively the following conclusions are drawn. 
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TABLE 10 (Continued) 









































11 12 13 14 15 16 17 18 19 20 | 21 22 | 23 | 24 25 
. SS LD 5D eee Sa IS cou pee a er gquasemeunen aces 
| 
04 
01 | .09 
—.06 09 —.02 } | 


—.06 | .11 |-.16 |-.02 
—.04 | .04 —.10 |-.03 | .03 
13 | 05 | 08 |-.04 | .09 | .08 | 
07 |-.01 | .11 |-.03 | .01 | .17 | .00 
00 | .03 | 06} .12 | O1 |~.13 | 04) .00 
01 '~.02 | .22 | .07 | 08 | 04 —.13 |-.04 |-.00 
—.04 | .08 | 03 | .12 |-.06 |-.05 —.08 | .01 |-.08 | .05 
03 | .16 |-.19 |~.01 | .11 | .09 | .18 | .06 |-.01 -.09 | .15 | 

—.03 | 09 |-.17} 05 | 07 | 07) 14] 07 | .02 |-.08 | .17| 84 

19 | .14 |-.07 |-.16 |-.04 | .19 |-.05 | .13 | .00| .06| .02| .26/| .30 

11 |-.14 |-.13 | 11 | 04] .29; 12] 23 10; .12 |-.07| .24/| 31 cad 



































1, Several tests constructed on the basis of face validity for kinesthesis 
yielded reasonably reliable results which are consistent from one sample to 
another. 


2. The tests in general show little inter-relationship. This would lead one 
at present to assume considerable specificity of function. 


3. There is no single test which has adequate validity compared with the 
criteria available to date. 


4. The level of validity of a test item depends upon the criterion being 
used. 


? 5. Several combinations of tests appear to have a validity value satisfac- 
tory for further use. 


6. The sensation of kinesthesis is made up of many elements or forms of 
response. There is little or no evidence that it might be a general capacity. 


7. Much more work needs to be done toward refinement of the measures. 
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Abstract 


This study was an experimental investigation of the emotional responses of boys 9 
through 15 years of age to highly organized competitive baseball. The subjects were 75 
boys who participated in Little League Baseball, 50 boys who were members of Middle 
League teams, and 80 boys who did not play on an organized team. 

The Galvanic Skin Response test was employed to measure the emotional changes of 
boys participating in league baseball and to compare the results of these tests with those 
of the same boys participating in softball competition in physical education classes. 
Insofar as the Galvanic Skin Response test can be taken to be a valid measure of 
emotional excitation of boys of this age level, the results of this study suggest that 
youngsters were no more stimulated by competition in league games than they were by 
competition in physical education games. 


THROUGHOUT THE NATION there is considerable interest in the promo- 
tion of competitive recreational activities for boys 15 years of age and 
younger and much has been said and written both pro and con in regard to 
the advisability of highly competitive activities for boys below the ninth 
grade. 

Critics argue that there are harmful aspects of Little-League-type baseball, 
one of which is the emotional over-stimulation of boys which results from 
such factors as the presence of large numbers of spectators at games and 
undue emphasis on winning games. On the other hand, advocates of highly 
organized activities believe this type of experience to be more beneficial than 
harmful to youngsters. 


Purpose of the Study 


It appears that the controversy will continue to rage until evidence other 
than statements based on opinions is produced. It was the purpose of this 
study, therefore, to obtain by means of the Galvanic Skin Response test, 
scientific data regarding the emotional responsés of boys in a variety of 
competitive situations. More specifically, it was the purpose of this experi- 
ment: 1. to test the emotional responses of boys before and after participa- 
tion in Little and Middle League games and to compare these results with 
similar tests on the same boys before and after competition in physical edu- 


1This study was made in partial fulfillment of the Ph.D. requirements at the University 
of Southern California, Los Angeles, California. 
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cation classes; 2. to compare the emotional response of league players with 
that of a group of non-players; 3. to determine whether there are dif- 
ferences in the emotional responses of boys at various ages; 4. to determine 
if there are differences in the emotional responses of boys to various in- 
tensities of competition; and 5. to determine the emotional reaction of boys 
to winning and losing games. 


a Terminology 


Little League Baseball, Inc.—A national organization which promotes 
organized baseball for boys between the ages of 8 and 12. 

Middle League Baseball, Inc.—A league which originated in Santa Maria, 
California, and promotes organized baseball for boys 12 through 15 years 
of age. 

League Players—Subjects who were members of Santa Maria Little or 
Middle League teams. 

Non-Players—Subjects who were not members of league teams either be- 
cause they did not choose to belong or because they did not possess adequate 
skills to be chosen on teams. All boys who volunteered to take the tests in 
the schools where the experiment was conducted were included as subjects. 

Galvanic Skin Response Test—A measure of changes im sweat gland 
activity. The G.S.R. affords an indication of the activity of the autonomic 
nervous system which is of major importance in the bodily changes associated 
with emotional states. 

Emotional Response—The emotional change of a subject from one time 
to another time. 

Emotional State—The emotional condition of a subject at one particular 
time. 

Resting Level—A condition in which the subject is presumed to be under 
no particular stress. 


Review of Literature 


The desirability of highly organized athletics for boys below the ninth 
grade has become a major controversial issue. Although the elementary 
schools have for many years felt pressure to adopt the characteristics of the 
interscholastic program, the problem has just recently been brought into 
focus by the fabulous spread of Little League Baseball throughout the nation. 
The program is popular, and in 1954, over 70,000 boys participated on Little 
League teams. The pros snd cons of the effects of such competition on boys 
has been discussed, and a search of the literature shows that a lack of agree- 
ment exists between educators, coaches, child specialists, and sportswriters in 
regard to policies and practices, advantages and disadvantages of such partici- 
pation. 

Those who favor Little League activities such as Ridinger (36), Hale 
(13), Zorn (34), Aromandi (4), and Thees (14), believe that the movement 
was born out of the needs and desires of young boys and that there are 
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tremendous possibilities for youngsters to develop personal security and 
emotional stability. Questionnaire studies done by the National Recreation 
Association (32), the Fresno Recreation Department (15), and Phebe Scott 
(37) indicate that the majority of parents whose boys play league baseball 
and many recreation departments throughout the country approve Little 
League Baseball for boys. 

On the other hand, there are those who feel that intense competition is not 
only educationally unsound, but also detrimental to the physical, psychologi- 
cal, and emotional well-being of young boys. Lowman (24, 25), McNeely 
(26), Morris (29), Keene (23), Morehouse and Miller (28) are a few who 
cite evidence to indicate that boys may be harmed physically by such activi- 
ties. Jersild (16), Fait (10), Shaffer (38), and Solomon (39) have written 
of the psychological effects which may result from improperly conducted 
competition, and Montgomery (27), Solomon (39), and Bucher (5) are a 
few who have written of emotional harm from intense competition. 

Many groups, through their national organizations, have passed resolu- 
tions against highly organized programs. The North Carolina Recreation 
Commission (35), the National Committee on Cooperation in Health Educa- 
tion (30), the National Federation of High School Athletic Associations 
(31), the National Recreation Association (33), the American Association 
for Health, Physical Education, and Recreation (1), and the National 
Conference on Program Planning in Games and Sports for Boys and Girls 
of Elementary School Age (2) represent many members who believe a broad, 
well-conducted intramural-type program would better serve the needs of boys 
than an interscholastic-type program. 

The literature revealed very few experimental studies that attempted to 
determine the effects of competition on emotion. Johnson (19) used a 
psychogelvanometer in combination with word association tests in order to 
study pre-competitive emotional behavior of college athletes and found that 
a pre-contest stress situation will provoke greater emotional responses to 
critical words in sports than the undisturbed condition. In another study, 
Johnson (18) reported that many sports champions report excruciating 
emotional distress before a contest even though they appear calm. 

. The analysis of the literature strongly suggests the need for objective evi- 
dence on all phases of competition for youngsters. Since no studies were 
found which tested the emotions of boys in the 9-to-15-year age group, this 
investigation was an attempt to meet this need by studying one phase of the 
problem—the immediate emotional responses of young boys to competition. 


Limitations of the Study 


When an experiment is conducted in the midst of real life situations rathe: 
than in the laboratory situation, precise controls are not always possible. In 
the present study, most of the testing took place in school buildings and on 
the play fields. It was, of course, impossible to eliminate the normal noises, 


the ringing of bells, and the interruptions that are part of the school or play- 
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ground situation. Also, it was difficult to control such factors as the sub- 
ject’s time of eating, the amount of exercise taken prior to the tests, and 
the possibility of extraneous emotional stimuli. However, a large number 
of subjects were involved and the small errors caused by lack of precise con- 
trols were probably minimized. 

Another serious problem involved the study of emotion itself. Emotions 
are extremely complex and neither the precise function nor the exact nature 
of the emotion is known. In studies such as the present one, emotion must be 
dealt with indirectly by way of physiological indicators which are known 
to vary more or less systematically with certain psychological states. Plainly, 
research findings based upon the physiology of emotion must be interpreted 
with care. 


Research Procedures 


The Galvanic Skin Response Test. When the emotional state of an indi- 
vidual changes, his body reflects this change in many ways. The most obvious 
indicators are the respiratory rate, muscular tension, pulse rate, and blood 
pressure. However, many other bodily changes are associated with emotional 
states and one of the most significant is the resistance of the skin to the 
flow of electric current. During some emotional states, the sweat glands of 
the palms become active and offer greater resistance to a current passed 
through the skin. This resistance can be measured and is called the galvanic 
skin response (12). It should be noted that neither galvanic skin response 
nor any other physiological indicator measures emotion per se. Some physio- 
logical states can be measured to ascertain the extent of certain bodily changes 
but these are merely suggestions of concomitant emotional excitation. 

In the present study, the instrument used was a sensitive galvanometer-type 
instrument. Attached to this were zinc-plated electrodes of the type used by 
the medical profession. Microhm jelly was used to insure contact between the 
electrode and the skin. 

The Subjects. The subjects were 206 boys ranging in age from 9 through 
15 who lived in Santa Maria, California, and adjacent towns. Of this total, 
75 boys were 9 to 12 years of age and were members of Little League teams 
during the summer of 1953. Fifty-one boys, ranging in age from 12 through 
15, played on Middle League teams, A group of 80 non-players, 9 through 
15 years of age, were chosen at random and used as a control group.” 

Administration of the Tests. Each subject to be tested was seated in a chair 
with his arms resting comfortably on a table. The palm of the right hand and 
a spot on the lower left arm one inch below the crease of the elbow were 
cleansed with alcohol and rubbed briskly with electrode jelly. The electrodes 
were applied to the palm and the arm and the reading recorded in ohms. 





2For their wholehearted co-operation in facilitating the collection of data, special 
gratitude is extended to William Ellis, founder of Middle League, Inc., the coaches of 
Little League and Middle League teams, the principals and teachers of the Santa Maria 
schools, and all the boys who participated in the study. 
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The following tests were administered: 

1. All subjects were tested immediately before softball competition in 
physical education classes, immediately after their class and 11% hours 
after class. 

2. The subjects who played on Little or Middle League teams were tested 
immediately before competition in baseball, immediately after the game was 
over, and 11% hours after the game. This procedure was repeated at three 
of their games. 

3. All subjects, players and non-players, were tested while in the most 
relaxed state possible and while under no known stress. Fifty of these 
subjects were retested in approximately ten minutes in order to check the 
reliability of the test. 


Analysis of the Data : 

Reliability. The correlation of the two resting level galvanic skin response 
scores for 50 subjects was .80 with a p.e. of .03. This finding is consistent 
with the high reliabilities found by other workers such as Elbel and Ronkin 
(10), Jones (23), Grings (13), Croslans and Beck (8), and Johnson (20). 

Resting Level Scores. The range of galvanic skin response scores for all 
subjects while at rest was 6,700 to 140,000 ohms. The population mean was 
17,400 ohms with a standard deviation of 10,700 ohms. Analysis of variance 
indicated that there was variability in scores from one individual to another. 

Table 1 shows the means for each age group. Table 2 shows a significant 
difference in scores between age groups at the 5 per cent level of confidence. 
The average resistance of individuals at different ages seems to be different. 






































TABLE 1 
Resting Level Means for Each Age 
i ne Ar Shae | ‘ . P eee rr ae 
Age No. Cases Mean in Ohms | Deviation 
9 | 19 1.60 67 
10 37 1.46 83 
ll 38 1,59 .76 
12 31 1.73 91 
13 29 2.73 3.06 
14 30 1.65 67 
15 26 1.52 58 
Total ee oe 
TABLE 2 
Analysis of Variance for Resting Level of All Subjects 
™j ~ gum of — | peo ee | : ne 
Source of Variance Squares D. F. [ Var. Est. F “oh P 
Bet. age groups _. 33.2 6 5.53 3.30 05 


Within groups....| 34165 | 203 | 168 | _ bia 
_* poet | 374,85 | 209 | 
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In general, the youngest boys tended to show the greatest skin conductivity 
and this decreased with age. The 13-year-old boys showed the least emotional 
response and, although the reason for this is not clear, it is possible that 
this phenomenon is in some way related to maturation and puberty. 

The resting scores of those boys who played on league teams were com- 
pared to scores of non-players. In general, the players at all ages evidenced 
greater skin resistance (less change from the resting level) than the non- 
players at all ages. See Table 3. 


























TABLE 3 
Differences Between Players and Non-Players for Resting Level 
~ Source of Variance 1 No, Cases | Mean | ee Se er a Be oie 
Age 9 
Players 9 1.73 18 +-.2089 -- 
Non-players __ 9 1.64 .22 
Age 10 | 
Players me 21 1.74 34 +6.86 01 
Non-players 17 1.35 | 035 
Age 11 
Players | 28 an +324 | Ol 
Non-players 11 1.13 Ay b. 
Age 12 | | 
Players . | 23 1.98 24 | +3.71 01 
Non-players 8 1.00 | 081 | 
Age 13 
Players ele 16 2.72 ' 58 = wi 
Non-players 13 2.75 1.10 
Age 14 
Players | 19 1.97 | 13 +420 | Ol 
Non-players 11 1.12 15 | 
Age 15 | | 
Players 16 1.88 12 +6,22 01 
Non-players 10 .96 08 


G.S.R. Scores Relating to Physical Education. Analysis of variance was 
computed for Little and Middle League players for the three physical educa- 
tion tests (immediately before and after class and 11% hours after class), 
and it was found that significant differences at the 1-per-cent level of confi- 
dence existed among the three tests. 

For the non-players, there were significant differences between the test for 
boys aged 9, 10, and 11 but not for those 12 through 15. 

In comparing the players with the non-players on the three tests, the Fisher 
t scores showed in general that players tend to show less emotionality than 
the non-players. See Table 4. 

G.S.R. Scores Relating to League Games. Seventy-five subjects who played 
on teams were tested three times on the day of a league game (immediately 
before and after the game and 11% hours after the game). Analysis of 
variance showed that for almost every age there were large differences in 
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TABLE 4 
Difference Between Players and Non-Players on Three Physical Education Tests 





Test ~~ Probability 











Test 1—before physical , 01 Players more relaxed 
education 





Test 2—After physical 3. el Players more relaxed 
education F — 








.05 Players more relaxed 





Test 3—1% hours after J _ 
physical education é — 
01 Players more relaxed 
.01 Players more relaxed 
.01 Players more relaxed 
.01 Players more relaxed 











3.83 





scores between the three tests and significant differences between the scores 
of individual players. See Table 5. 

Comparison of Physical Education Scores with League Scores. There ap- 
peared to be a general pattern of emotional response for the four tests taken 
on one day either for physical education competition or league competition. 
The resting level tests, as would be expected, showed the most relaxed state 
of the players and non-players. Just before participation in a game, league 
players were in an excited state which generally increased right after the 
game was over. On the third test 114 hours after the completion of the game, 
the players consistently reached a higher degree of relaxation than they had 
before the physical education and league games. However, this level of 
relaxation was still far below the resting level scores. 

With the exception of the nine-year-olds, the non-players were considerably 
more excited 14% hours after their physical education games than they were 
before the start of the games. 

When emotional reactions of subjects before a physical education game 
were compared with those for league games, no significant differences were 
found. The players showed no greater skin response in anticipation of a 
league game than they did before competition in physical education classes. 

Of the tests taken immediately after the games, the 10-, 11-, and 13-year- 
old players showed more skin response after physical education competition 
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TABLE 5 


Analysis of Variance for the Game Situation for Players 
(Average of Three Games) 





Sum of 
Source of . ariance Mean Squares D. F. Var. est. 

Age 9 (9 subjects) 
Between tests 1.50 1.04 34 
Between players —_. 4.15 51 
Residual _____. 2.05 85 
Age 10 (21 subjects) 
Between tests A 2.05 68 
Between players —__._ 27.86 1.39 
pO ee 19,37 31 
Age 11 (28 subjects) 
Between tests ‘ 4.70 1.56 
Between players _. 37.53 1.32 
Residual ote ee! 35.70 44 
Age 12 (23 subjects) 
Between tests - ‘ 7.22 
Between players __ 26.29 
Residual 26.98 
Age 13 (16 subjects) 
Between tests __ ’ 26.07 
Between players - 30.64 
Residual _... 55.34 
Age 14 (19 subjects) 
Between tests - : 9,52 
Between players _ 7.00 
Residual - 4.18 
Age 15 (16 subjects) 
Between tests ; 6.64 
Between players 7.52 
Residual 8.81 


























than they did after a league game. Also, the 10-, 11-, and 12-year-old players 
showed more skin response 114 hours after their league games. At all other 
ages, there were no significant differences. See Table 6. 

Emotional Reactions to Winning and Losing Games. Galvanic skin re- 
sponse scores of players taken immediately after losing league games were 
compared to the scores taken immediately after winning games, and the 
results as shown in Table 7 indicated that players show greater skin response 
after winning than after losing a game. 

Comparison of G.S.R. Scores for a League Game, a Physical Education 
Game and a Championship Game. There appears to be little difference in 
the galvanic skin response of boys to different intensities of competition. 
The only significant difference, at the 1-per cent level of confidence, was the 
greater excitability of Little League players before a semi-final state cham- 
pionship game as compared to a quarter final game. 
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TABLE 6 


Analysis of Variance for Players in League Games and Physical Education Games 








Source of Variance t 


Age 9 
Before a physical education game—before a league game ll 
After a physical education game—after a league game ~ ; 1.36 
1% hours after a phys. educ. game—1% hours after a league game 74 

Age 10 
Before a physical education game—before a league game 7 
After a physical education game—after a league game 1,90 
1% hours after a phys. educ. game—11% hours after a league game} 2.50 

Age 11 
Before a physical education game—before a league game 1.49 
After a physical education game—after a league game 2.74 

14 hours after a phys. educ. game—1'% hours after a league game} ‘1.84 

Age 12 
Before a physical education game—before a league game 1.49 
After a physical education game—after a league game 151 
1% hours after a phys. educ. game—11 hours after a league 2.83 

Age 13 
Before a physical education game—bhefore a league game 1.03 
After a physical education game—after a league game 2.33 
1% hours after a phys. educ. game—1%4 hours after a league game 39 

Age 14 
Before a physical education game—before a league game Raf 
After a physical education game—afier a league game 51 
14 hours after a phys. educ. game—1%% hours after a league game 7 

Age 15 
Before a physical education game—before a league game 1.59 
After a physical education game—after a league game .08 

1% hours after a phys. educ. game—1% hours after a league game, 1.22 











TABLE 7 


Significance of the Difference Between Winning and Losing a League Game 





lied Bite is Beas NG an te Oe ees 
No. Cases No. Cases t | 


| 
132 
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Summary and Conclusions 


The present experiment was designed to collect objective data regarding 
the effects of competition on the emotions of boys between the ages of 9 and 
15. The galvanic skin response test was employed to measure emotional 
changes of boys who were participating in Little League and Middle League 
Baseball and to compare the results of these tests with those of the same 
boys participating in softball competition in physical education classes. 

Many critics of Little League Baseball have stated that highly organized 
competition places physical and emotional strains on boys which may be 
harmful to their general welfare. Insofar as the G.S.R. can be taken to be a 
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valid measure of the emotional excitation of boys of this age level, the 
results of the present study suggest. that youngsters were no more stimulated 
by competition in league games than they were by competition in physical 
education games. 

It should be remembered that this study was concerned with only one 
phase of the total problem of competition—the immediate effects of competi- 
tion on emotionality. In order to completely solve the problem of highly 
organized competition, data must be gathered relative to the physical, socio- 
logical, psychological, and economic aspects of competition. Furthermore, to 
resolve the specific controversy concerning the emotional effects of competi- 
tion, it is necessary that additional data be secured, particularly in regard 
to the influence of emotion on personality now and later in life. 

Within the limits of this study of boys 9 through 15 years of age who 
played Little League and Middle League competitive baseball in Santa 
Maria, California, and within the limits of information about emotion as 
derived from the galvanic skin response test, the evidence presented supports 
the conclusion that Little-League-type competition is no more stimulating 
than competition in school physical education. 
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Game Results as a Criterion for 
Validating Basketball Skill Test 


FRANCIS STROUP 
Southern State College 
Magnolia, Arkansas 


Abstract 

In this study the scores of the competing teams in 31 ten-minute basketball games are 
compared with the skill score averages of the teams. The skill score averages were com- 
puted from scores on passing, dribbling and shooting tests. Approximately 84 per cent 
of the games were won by the team with the higher skill score average. 


THE SELECTION of a suitable criterion for establishing test validity is 
a major problem in test construction. Attempts to express numerical co- 
efficients of validity have stimulated the use of a variety of such criteria. 
The criteria used in validating physical education performance tests have 
been predominantly judgment ratings made by qualified observers (1, 5, 6, 7). 
However, other criteria have been used. 

Dyer used relative final positions of subjects in a round robin tournament 
as the criterion for establishing the validity of a tennis test (4). Criteria 
based on game results have also been used by Lockhart and McPherson (9) 
in badminton, by Miller (11) in badminton, by Lockhart and Mott (10) in 
table tennis and by Cornish (3) in handball. 

But the use of game results as a criterion in validating tests has been 
limited largely to individual and dual sports. This limitation is attributable 
to the relative ease of controlling factors in games involving one or two 
performers. 


The Problem 

Two related purposes provided the problem for investigation in this study. 
They were: (1) to demonstrate the use of a validation technique for a team 
sport test in which game results were used as the criterion, and (2) to estab- 
lish the validity of an administratively economical test for equating teams. 

As a means of improved validation. The demonstration of a technique in 
which game results are used as a criterion for validation would be an im- 
portant step in attaining greater confidence in sport skill tests. Regardless 
of the magnitude of the derived coefficient of correlation between scores on a 
test and a selected validity criterion, the test will not inspire confidence if 
there is doubt regarding the appropriateness of the criterion. And confidence 
is necessary if a test is to receive wide use. 

No criterion for determining the validity of a sport test seems more 
appropriate than the results of contests in that sport. If it is possible to show 
that such a criterion can be suitably employed, tests validated by the use of 
such a criterion should inspire greater confidence and consequently should 
receive wider use. 
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Demonstration of the use of such a criterion could perform a second func- 
tion. If game results are used successfully as a criterion for validating a 
basketball test, the demonstration of that use should suggest ways of using 
a similar criterion for validating tests in other team sports. Thus, further 
research and consequent improvement in physical education sports testing 
might be stimulated. 

As a means of equating teams. An easily administered test for equating 
basketball teams has many potential uses. Most of the benefits claimed for 
sports participation are lost unless a high level of interest by the participant is 
maintained, Grossly unequal competition is a great deterrent to such interest. 
A simple test for reducing the inequality of teams would be useful, not only in 
school classes and intramural programs, but also in other institutions such 
as vacation camps and military installations where participants are grouped 
for competition on short acquaintance. 

The achievement of these two purposes seemed sufficiently important to 
justify this study. 


Procedure 

In order to achieve these purposes, a study was made using men enrolled 
in four sections of Freshman and Sophomore physical education classes at 
Southern State College as subjects. 

The test items. Since the use of an administratively economical test was 
one of the purposes of this study, it was decided to use a three-item test. 
Most investigators of basketball skill tests have recommended a battery of 
five items or less (7, 8, 12). Glassow, Colvin, and Schwarz indicated that the 
fourth and fifth items investigated by them made no contribution to the 
validity of the battery (6). The three items selected for use in this study 
were goal shooting, wall passing, and dribbling. 

In the goal-shooting test, the subject was allowed to stand as near as he 
wished to the basket and shoot as many baskets as possible in one minute, 
retrieving the ball each time himself. No penalties were invoked for rules 
violations since the time limit penalized and discouraged such infractions, 

In the wall-passing test, the subject stood behind a line six feet from a wall 
and passed against the wall as many times as possible in one minute. It was 
considered a miss to bat the ball instead of catch it or to move beyond the 
restraining line while handling the ball. The score was the number of legal 
passes made in one minute. 

The dribbling test required the subject to dribble alternately to the left 
and right of bottles placed in line and 15-ft. apart on a 90-ft. court, circle 
the end bottle, and continue in this manner for one minute. It was considered 
a miss to knock over a bottle or to not pass a bottle on the proper side. The 
score was the number of bottles properly passed in one minute. 

Acceptable coefficients of reliability and validity have been reported for 
similar or identical test items by other investigators (7, 8, 12, 6). The 
criterion for validity in these studies was either subjective ratings or scores 
on a larger battery of tests. 
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Scoring procedures. Previous experimentation with the test items used in 
this study has led to the development of scale scores with letter equivalents 
for performances on each item. These scale scores were based on a theoretical 
division of the population of the normal curve. The scale scores, letter equiva- 
lents and division of the population are indicated in Table 1. A subject’s raw 
scores for the three items were converted to scale scores which were then 
averaged to obtain his basketball skill score. A table used for this conversion 
is shown in the Appendix, page 357, 


TABLE 1 
Corresponding Scale Scores, Letter Grades and Per Cent of Population of Normal 
Distribution 
Scale Score | Letter Per Cent 

91-100 A 7 

81- 90 B 24 

71- 80 C 38 

61- 70 D 24 

F 7 





As a part of this study, ten-minute basketball games were played in four 
team sport sections of physical education classes for men. These games 
were spaced over the last six weeks of the semester and team members for each 
game were selected at random. A scorekeeper kept the lineups and scores. 

At the close of the semester, the three-item basketball test was administered 
to class members. Skill scores for the subjects were computed as previously 
described and each subject’s skill score was inserted before his name on the 
score sheet of each game in which he participated. 

Skill scores for the members of each team were averaged and comparisons 
of competing teams were based on average skill scores and game scores. 


Results 

Forty-one ten-minute games were played during the study. The three-item 
basketball test was administered to 121 subjects. Subjects’ skill scores derived 
from test performances ranged from 52 to 84 with a mean of 72.65. 

Owing to drop-outs and injuries, all men participating in the games did 
not take the test. Skill scores were available for all members of 75 teams and 
averages for these teams ranged from 65.2 to 78.6 with a mean of 73.31. 

There were 38 games for which average skill scores were available for both 
competing teams. In one of these, the teams had identical skill score averages 
and played to a tie game. In six games, the competing teams had equal 
average skill scores or equal game scores. There was a difference in both 
skill scores and game scores for the competing teams in 31 games. 

The average skill score differences for these thirty-one games ranged from 
0.4 to 10.2 with a mean of 3.45. The game score differences ranged from 
2 to 18, with a mean difference of 7.36. 
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Of the 31 games in which differences both between average skill scores and 
between game scores for the competing teams were observed, 26 (83.87%) 
of the games were won by the team having the higher average skill score. The 
five games with outcomes counter to this pattern had average skill score 
differences of 0.8, 1.0, 2.2, 4.6, 6.6 and respective game score differences of 
6, 6, 4, 4, and 6. In no case did a team having an average skill score ad- 
vantage of more than 6.6 lose a game. 

Summary 

This study was designed: 1. to demonstrate the use of a validation tech- 
nique for a team sport test in which game results were used as the criterion, 
and 2. to establish the validity of an administratively economical test suitable 
for equating basketball teams. 

To achieve these objectives, members of four physical education classes 
played ten-minute games during basketball participation periods. At the 
close of the semester a basketball test consisting of passing, shooting, and 
dribbling was administered to them. 

Raw scores on the test items were converted to scale scores which were 
then averaged to find each subject’s basketball skill score. This skill score 
was inserted before each subject’s name on the score sheet of games in which 
he had participated. By this method, average skill scores of competing teams 
could be compared with the scores resulting from the games. 

For instances in which there were differences both in average skill scores 
and in game scores for the two competing teams, 83.87 per cent of the games 
were won by the team having the higher skill score. In cases not complying 
with this pattern the differences between average skill scores and the dif- 
ferences between game scores were relatively small. 


Conclusions 

From the results of the study, these conclusions were drawn: 

1. The procedure employed in this study demonstrates the use of ten-minute 
game results as a criterion for validating a team sport test. 

2. Average skill scores derived from scores on the three-item test are a 
valid measure of team strength in basketball because relative skill scores of 
competing teams was related to the ability to win ten-minute games. : 

3. The test appears to be a practical instrument for equating teams, because 
knowledge of average skill score differences for competing teams in games 
not involving ties would have allowed correct prediction of the outcome of 
approximately 84 per cent of these games. In the remaining 16 per cent, dif- 
ferences in average skill scores and game scores were usually small. 
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Emotion in Fitness Tests 


Prudential Insurance Company of America 
Newark, New Jersey 


Abstract 
The paper presents a method of estimating the emotional factor in flarimeter tests on 
athletes, by analyzing an index of responses to standard stimulus. 


FITNESS TESTS have failed to meet expectations. In studying flarimeter 
tests' on athletes it has become evident that the real reason for this is that 
such tests are disturbed by emotion. This paper presents a method of esti- 
mating the emotional factor. 


Standard Stimulus 

Standard stimulus for athletes (4) consisted, first, of Master’s standard 
step exercise for 90 seconds. The subject was then seated with dispatch, 
began a flarimeter blow 10 seconds after exercise which was stopped after 
blowing 30 seconds. The subject then sat still until the final blow was begun 
at 2144 minutes after exercise, which was held until the systolic blood pres- 
sure rose 30 mm. 

Systolic blood pressure was read at 1 minute after exercise. Let S,; repre- 
sent its value in mm of mercury. Diastolic blood pressure was taken im- 
mediately after the systolic. Call this D,;. Similarly, S, and Dz were the 
readings at 2 minutes after exercise, and R’ the pulse rate per minute, from 
a 15-sec. count begun at 114 minutes after exercise. 

De was usually equal to the diastolic at rest before exercise, indeed D, was 
seldom appreciably higher, and the pulse rate R’ was usually slightly below 
its pre-exercise value. Most of the response was in S;. It was soon discovered 
that the reason for this was the powerful vagus action of holding the breath 
in the first blow after exercise. Continuous electrocardiographic records of 
Lead II showed sudden vagus slowing whenever the heart accelerated enough, 
most often toward the end of the blow but sometimes early in the blow or after 
it stopped. 


Flow Index 

To avoid bias, the responses to standard stimulus were studied separately 
for over 15 years, in spite of the obvious tendency of the systolic pressure to 
vary reciprocally with the pulse rate. Finally, the five measures (S;, D,, R’, 
Se and D2) were combined into an index, F’ = P’R’/100, where P’ = 
(P; + Pe)/2 and P; = S; — D,, P2 = Se —De. This seemed the least 
empirical formula to use. It was not intended as a law of physiology, al- 


1The flarimeter is a clinical instrument for testing circulatory fitness (2, 3). The sub- 
ject blows through a standard orifice at a standard pressure. The examiner can follow 
the systolic blood pressure during the blow, to measure the circulatory response. 
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though the “pulse product” has been studied seriously as a measure of blood 
flow ever since its introduction by Erlanger and Hooker in 1904 (5). In view 
of this, it seems permissible to call F’ = P’R’/100 a “flow index,” in arbitrary 
units. Its median value for Princeton University athletes was about 30 after 
standard stimulus, while the median value of F = PR/100 sitting at rest 
before exercise was about 25, where P = S — D was the pulse pressure and 
R the pulse rate.’ 

The flow index is much better than blood pressure or heart rate as a 
measure of the response to standard stimulus. Momentary changes in pulse 
pressure or rate are often erratic when their product, the flow index, is 
steady. To determine the relation of F’ to F, individual medians of 83 tests 
on 31 Princeton varsity swimmers at peaks of training (selected out of 226 
tests on them in all conditions) were plotted in a scatter diagram, shown in 
Fig. I. A least-squares fit gave the regression line F’ = 1.2F, with a correla- 
tion r == 0.87, but the fit of line F’ = F +- 5 is quite as good, as shown by 
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Ficure I. Scatter Diagram of the Flow Index. 
Abscissas — F, the index before exercise. 
Ordinates — F’, the index after standard stimulus. 

(Circles represent second points identical in values with dots inside) 
2Since P’ is the average of P; and P2, it approximates the pulse pressure simultaneous 
with the pulse rate R’. So the product F’ has some physiologic meaning. It is not merely 
a statistical index. Physiologists point out (1) that F’/F measures relative blood flow 
only when arterial elasticity does not change, but the fact that the diastolic blood pres- 
sure after standard stimulus is usually nearly the same as before exercise is presumptive 
evidence that F’ is a fairly good crude index of blood flow under the conditions of the 
test, 








360 The Research Quarterly, Vol. 26, No. 3 


Table 1, and this line is more practical. A summary of its agreement with 
individual medians is shown in Table 2. These tables give the average dif- 
ferences between observed and calculated values of F’. Disregarding sign the 
average deviations are as shown in the columns headed (= a.d.). 

Notice that seven of the 31 points in Fig. I actually fall on the line 
F’ = F + 5. It is quite obvious that standard stimulus produces an almost 
constant increment of five units in the flow index. However, there is a 
tendency for the increment to increase with the index. Out of 204 individual 
medians, 92 with F’ less than 30 give a median increment of three, while 


TABLE 1 
Comparison of Fit of Formulas 
Fo =F +5andF’=12F 
to Individual Medians on 80 Varsity Swimmers 
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54 with F’ 35 or over give a median increment of eight units. The same 
excitability which produces above-average flow index also increases the 
response to standard stimulus. The expected increment in the index required 
to clean up the oxygen debt, therefore, is about five units or 20 per cent. 

To make sure there was no bias in the medians, the formula was applied 
to unselected single tests. The 200 latest tests on swimmers, the 100 latest 
on runners, all the tests on the 1937 basketball team and two varsity crews 
were taken, with results shown in Table 3. The fact that the formula fits 


TABLE 3 
Fit of Formula F’ = F +-5 to 518 Unselected Tests 

















Medians 
Men Kinietes teis [| COberdaie) | 
92 Swimmers 200 27 32 .00 4.6 
45 Track 100 25 30 —.25 3.7 
10 Bk 1937 73 27 31 —.14 3.7 
29 Crew 1948 47 29 32 —2.00 4.5 
12 | Crew 1936 | 98 | 30 34 —54 3.5 
188 518 —.35 4.1 




















tests taken at random suggests that the deviations from it are largely due to 
emotion. 


A male college athlete sitting at rest should have a flow index less than 
25 and it should not rise above 30 after standard stimulus. Yet more than 
half of them do, probably because of emotion. In fact, the flow index is 
often observed to be twice the expected, and it may fall, not rise, after stand- 
ard stimulus. It is perfectly evident that there is no physiologic need for such 
erratic changes. They may be due to vasomotor reactions which themselves 
are largely due to emotion. Some champions have shown no such vagaries in 
tests over their whole college careers. 

The effect of emotion can be differentiated from the physical changes. Thus 
a fall in the index after standard stimulus, when a rise of five units is ex- 
pected, can mean only that the heart was working quite unnecessarily at rest 
before exercise, and calmed down, as the result of the stimulus, more than 
five units. A high F to begin with reinforces the argument. On the other 
hand, a flow index F = 20 or less which increases more than five after 
standard stimulus shows that the subject was truly at rest before exercise, 
but the stimulus produced an excessive response. This may be merely 
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emotional. To prove it requires success in dispelling the emotion, which is 
not always easy. The relation of emotion to impairment is still further 
involved. 


Discussion 


A very real oxygen debt is incurred in athletic events. Shortly after a 
workout, however, it is not unusual to observe a flow index of 20 or less, 
long before the oxygen debt has been cleaned up. Apparently, the circulation 
is slow in working it off. So the immediate response to exercise is largely 
anticipatory in nature. Indeed, the mere thought of a contest is enough to 
set the heart racing. As soon as the exercise ceases, the threat to the athlete 
is removed and he relaxes to a comfortable flow index. Thus, performance 
tests are inextricably involved with emotion. 

Standard stimulus has been very carefully designed to avoid emotional 
stimulus. If the same number of ascents is performed in less than 90 seconds, 
the responses are greater. Any novelty, such as attaching electrocardiographic 
leads, produces an emotional reaction even in those complacent to the 
standard technique. Ordinarily, the flow index (F’) decreases from test to 
test, as familiarity lessens emotion. The fact that much can be done to induce 
an athlete to relax is perhaps the best evidence that the vagaries in circula- 
tory response are largely due to emotion. 

To illustrate by examples requires too much detail; but an experience of 20 
years testing 426 athletes repeatedly, many through their whole college 
careers and some even longer, has convinced the author that the flow index 
is the best practical way to discount the emotional factor in flarimeter tests.* 
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Professional Courses in the Prevention and Treatment 
of Athletic Injuries at American 
Universities and Colleges 


LANE B. BLANK 
Franklin Junior High School 
Long Beach, California 


THE PURPOSE OF this study was to examine and analyze the course offerings in the 
prevention and treatment of athletic injuries for the preparation of male professional 
personnel in physical education at American universities and colleges. 

The first portion of the study consisted of the examination of the catalogs of all 
American institutions of higher learning which offer professional education in the fields 
of health education, physical education, and recreation. Of the 419 schools investigated 
222 were discovered to offer the type of course in question, and 13 of the latter were 
found to have two such courses. 

The principal technique of investigation consisted of mailing questionnaires concerning 
specific courses and their characteristics to each of the aforementioned 222 institutions. 
One hundred twenty responses (approximately 54 per cent) were received, thus consti- 
tuting a fairly adequate degree of response (1). 

The findings of the present study are summarized as follows:! 

1. More than one-half of the courses in athletic injuries are classified at the level of 
the third-year college student. 

2. The following titles account for more than 60 per cent of the courses investigated: 
“prevention and treatment of athletic injuries,” “athletic training and conditioning,” and 
“first aid and care of athletic injuries.” 

3. The most common prerequisites for courses in athletic injuries appear to be (in 
order) : anatomy, physiology, first aid, kinesiology, and physiology of exercise. 

4. A course in athletic injuries is required for the attainment of the Bachelor’s degree 
or teaching credential in physical education at the majority of institutions. 

5. Nearly 70 per cent of the courses were reported to be benefited by the appearance 
of guest instructional personnel, particularly team physicians and athletic trainers. 

6. Study of the prevention, care, and treatment of athletic injuries ranks the highest 
in regard to the amount of class time consumed, while the least time is devoted to the 
responsibilities and limitations of the trainer and coach and to the study of training 
facilities and equipment. 

7. The most widely used teaching techniques employed appear to be the lecture, 
demonstration, laboratory, and examination activities, while the least widely used ex- 
periences include group meetings and reports, field trips, visits, and observations. 

8. The most popular textbook in use at present is that of Cramer (2). 

9. The more popular items of subject matter suggested by respondents for future in- 
clusion in courses in athletic injuries are nutrition, the specifics of treatment, physical 
therapy, and review of the basic sciences. 

REFERENCES 
1, AMERICAN ASSOCIATION FOR HEALTH, PHYSICAL EDUCATION, AND REORPATION. Research 
Methods Applied to Health, Physical Education, and Recreation. Wash., D. C.: AAHPER, 
2. oun tae L., L. L. Boventon, and CHARL#s CrAmMeR. A Training Room Manual. 
Gardner, Kans.: Cramer Chemical Co., 1945. 144 pp. 
8. HerrIncton, Grorarp Squires. An analysis of courses in educational sociology with pro- 
posed changes. J. Educational Sociology 22: 259-75 (Dee, 1948). 


STaFrorD, FRANK S., and Svea T. Srrern. Institutions Offering Professional Education 
in Physical Education. Wash., D. C.: U. 8. Govt. Printing Office, 1949, pp. 1-4. 


1Copies of the tabular matter summarized herein will be supplied by the author upon request 
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Predictive Ability of a Selected Basketball Test 


CLIFFORD A. BOYD, JAMES R. McCACHREN, AND I. F. WACLOW 

University of Florida 

Gainesville, Florida 

ONE OF THE PROBLEMS confronting a coach is that of economy of time and effort 

in making an efficient selection of his squad members. With this in mind, the investi- 

gators decided to determine the predictive ability of a test battery in selecting members 

of a basketball team. A review of the literature revealed the Knox Basketball Test which 

reported a reliability coefficient of .88 as a good battery to use. This battery consisted of a 
speed dribble, wall bounce, dribble and shoot, and a “penny-cup” test. 

The test was administered to 42 candidates for the University of Florida Junior Varsity 
squad. At the end of three weeks of practice, the coach was asked to rank the players 
according to playing ability. The coach retained the top 18 on his squad and the re- 
maining 24 players were dropped. The coach’s ranking was placed in a sealed envelope 
to be opened at the end of the season. At the end of the season, the coach also ranked 
his 18 squad members as to ability without consulting any of the written records. 

In order to arrive at an equitable system of ranking players regarding playing time, 
the total number of minutes played was divided by the number of games played, thus 
yielding an average playing time per game. The players were also ranked by comparing 
total points scored with average minutes played per game. 

A bi-serial correlation of .96 was revealed when the test results were correlated with 
the categories of team membership and non-team membership. 

The five variables were correlated using the rank difference method of correlation. 
Results are given in Table 1. 


TABLE 1 


Intercorrelations of Knox Basketball Test, Coach's Rank (End of 8 Weeks and End of Season), 
and Game Factors (Minutes Played and Points Scored) 
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2. Coach (End of Three Weeks) x 27 -23 52 
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4. Average number of minutes played per game..' x Al 
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There were only two significant rank difference correlations. First, a rho of .85 was 
obtained between the coach’s ranking of the 18 squad members at the end of the season 
and the ranking of the 18 players according to average minutes played per game. Sec- 
ondly, a rho of .79 was obtained between the coach’s ranking of the 18 players at the 
end of the season and the ranking of these players as to total points scored per minute 
of playing time. This would indicate that the coach made a valid rating of his players if 
time played and the number of points made per minute of playing can be considered as 
valid criteria of playing ability. 

The Knox Basketball Test appears to have merit as a predictor of squad membership in 
basketball; that is, it can readily distinguish between varsity and non-varsity caliber 
players. On the other hand, the value of the Knox Test in distinguishing between levels 
of ability among squad members does not have any statistical significance. 
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COMMENTS 


Comments on the article by Alfred W. Hubbard and Charles N. Seng, ‘’Visual Move- 
ments of Batters,’ Research Quarterly, 25: 42-57 (March 1954). 


ALTHOUGH ONE continues to hear echoes of the attractive Baconian notion that 
sciences proceed best by avoiding all hypotheses and beginning with the facts themselves, 
there has been a growing recognition of the importance of hypotheses in any scientific 
enterprise. Hypotheses constitute the first step in any scientific undertaking. The par- 
ticular ones proposed for a problem set the stage for observation. Without them one 
would not know what to observe. And the observations serve to confirm or reject the 
hypotheses involved. 


Occasionally, however, investigators tend to forget the role of hypotheses in the 
scientific enterprise, and they behave as though their hypotheses were established facts. 


In their report on visual movements of batters, Hubbard and Seng state that: 


‘. . , batting is not primarily a reaction-time problem. The stimulus object (ball) is continu- 
— visible during its flight, not suddenly presented. Consequently the problem is one of 
tracking a moving object, predicting its course and, at some point in its flight, deciding to swing 
or not. This prediction subserves the motor response. Although generating momentum in the bat, 
and the preceding central processes, involve some duration, any sensory component in reaction 
time is essentially eliminated. In other words, estimates based on reaction time or when the 
batter must make his decision and start the process which will result in the swing place the point 
too far back in terms of ball flight.”” (p. 42) 


Their analysis presents two interesting hypotheses relative to the general batting situa- 
tion. One hypothesis is that the batter can track a pitched ball, and the other is that 
the batting response is unlike a reaction-time response because the sensory component is 
essentially eliminated. As is the case with all hypotheses, one can and should ask the 
question, Does the evidence confirm or refute? 


The main evidence investigators offer for tracking a pitched ball is presented in Table 
5, entitled, “Incidence of Head and Eye Movements by Frames from Complete Sequences 
in Reaction to Flight and Speed of Ball.” (p. 51) Inspection of these results reveals that 
no apparent tracking movements were found during practically the first half of ball flight. 
In short, there was no evidence of tracking during what may be considered the critical 
period of ball flight. The investigators, of course, are willing to concede that their 
measurement technique probably was not adequate for recording possible head or eye 
movements during the early part of flight, but this limitation cannot be said to constitute 
# guarantee that tracking did occur. 

If one considers the interval of ball flight up to the point where the central processes 
took over and led to the generation of momentum in the bat, as the investigators explain 
events, the situation is not much better. It is stated, for example, that the momentum for 
the swing was probably generated and applied to the bat during the fourth or fifth frame 
before the hit. If one then used the extremely conservative estimate of two frames (ap- 
proximately 0.08 sec.) for the preceding central processes, making an over-all estimate of 
six or seven frames, Table 5 is indeed revealing. Practically all instances of apparent 
head and eye movements occur after commencement of the central processes. Since the 
central processes presumably cannot do much in the way of predicting the ball’s course, 
deciding to swing or not to swing, and generating momentum in the bat until information 
has been received, any evidence of tracking is essentially eliminated. 
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The principal thing the investigators have to show for their efforts is that there ap- 
peared to be some apparent head and eye movements after it was too late to track the 
ball. Precisely how any conclusions relative to tracking can be reached under these 
conditions is difficult to understand. The fact that the investigators were able to even 
specify the type of eye movement batters are supposed to use in tracking can only indi- 
cate an act of supreme faith in the ability of batters to track a ball. In other words, the 
hypothesis of tracking was assumed to be a fact at the outset, and interpretations of the 
data followed the implications of this assumption. To state that such procedures seldom 
prove fruitful is a mere truism. 

Although the limitation of a note does not permit a detailed evaluation of the hypothesis 
that batting is not primarily a reaction-time problem, it is possible to comment briefly 
on one of the consequences involved. The investigators, for example, object to the reac- 
tion-time hypothesis on grounds that “estimates based on reaction time of when the 
batter must make his decision and start the process which will result in the swing place 
the point too far back in terms of ball flight.” On the basis of this assumption, one should 
expect estimates of the interval between commencement of the central processes and the 
hit to be shorter than reaction time estimates of batting. That such is probably not the 
case can be simply demonstrated. 

Using the conservative estimate of six or seven frames before the hit as the point 
where the central processes must begin to act on information previously received, and 
using a time interval of 0.04 sec. between frames, it would seem that the investigators’ 
batters must have started reacting to the ball’s flight no later than 0.24-0.28 sec. before 
the hit. In contrast to this interval, studies on batting reaction time found that starting 
reaction times in simple and choice situations required 0.21-0.29 sec. The surprisingly 
close agreement between these two estimates can hardly be said to support the hypothesis 
involved. 

It is to be emphasized that the estimate of 0.08 sec. for the central processes is un- 
doubtedly far too conservative. Even in a relatively simple response situation, the cen- 
tral processes appear to require the greatest proportion of time. For example, in his dis- 
cussion of a tracking situation involving a reaction time of approximately 0.30 sec., Craik 
(1) notes that: 

Physiological studies of the response time of the eye and the time taken for impulses to be trans- 
mitted up the optic nerve show that these can account for about 0.01 sec. only, and electrical 
stimulation of motor nerves and records of the start of the limb-movement which results show 
that this accounts only for another 0.01 sec. approximately. There is left about 0.28 sec. to be 
accounted for in the cerebral processes. 

REFERENCES 
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—Arthur T. Slater-Hammel, Indiana University. 


Rebuttal to above comments on “Visual Movements of Batters” 


SLATER-HAMMEL CONJURES up two hypotheses from a study which his introduc- 
tory statements lead one to believe had none. The actual hypotheses established for test- 
ing may be found in the “Purpose of the Study,” the preceding paragraph and various 
parts of the text (p. 43 ff.). But since he chose to set up as “hypotheses” an assumption 
and an opinion, neither of which was established for testing, we shall discuss them, 
Slater-Hammel’s first “hypothesis” concerns our assumption that “a batter can track a 
pitched ball,” or that visual perception depends on tracking the ball. Distinct vision 
requires a stationary image on the retina, So, in batting, eye movement consistent with 
ball movement (up to the limits of eye movement) would seem necessary to subserve 
perception of the speed and direction of pitched balls, One could tell pretty well 
whether the eyes were focused on the ball in frames in which both were visible, but 
evidence of tracking before that necessarily depended on eye movement consistent with 
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ball movement. Furthermore, evidence of eye movement inside 30 ft. presupposed pre- 
vious tracking. In interpreting our Table 5, the important point was the absence of evi- 
dence of tracking in the last frame or two, rather than lack of evidence in early frames 
{p. 51). 

The absence of evidence of tracking in early frames is inherent in the situation and 
integrated with the perceptual part of the batter’s problem—to differentiate speed and 
direction of pitched balls. In solving this problem preliminary movements by the 
pitcher may be misleading, but after delivery the batter’s cues come from increase in 
apparent ball size and ball movement. Differences in the rate of increase of apparent 
ball size probably provide cues to ball speed. But we were more interested in apparent 
ball movement, which can be reduced to horizontal and vertical movement—the latter 
depending on angle of projection and ball speed, which complicates computation. How- 
ever, assuming ball flight in a vertical plane, the horizontal angular movement would be 
the angle whose sine is proportionate movement divided by distance from the batter. 
That is, for a ball that would pass in a vertical plane 6 in. from the batter’s eyes, the 
horizontal angular movement from release to 30 ft. from the batter would be 3/360, or 
29’ (minutes), and for a ball passing 36 in. from him, 18/360, or 2°51’. To put it briefly: 
half of the apparent horizontal movement from release to 30 ft, occurs from 40-30 ft.; as 
much apparent movement occurs between 30 and 20 ft. as between release and 30 ft., and 
about three times as much occurs between 20 and 10 ft. as between release and 30 ft. or 
between 30 and 20 ft. 

Note the small apparent angular movement from release to 30 ft. or even to 20 ft. For 
the batter, differences in the rate of angular movement and increase in ball size during 
the first half of ball flight provide a poor basis for diflerentiating ball direction and 
speed, but inside 30 ft. the bases become exponentially better. For the experimenters, 
such slight corresponding eye movement during the first 30 or 40 ft. of ball travel spread 
over 5 to 12 frames was not measurable. Microscopic analysis of enlarged prints might 
have shown evidence of movement. But since evidence of tracking inside 30 or 20 ft. pre- 
supposed previous tracking and since one of the primary problems was to determine 
whether the ball was tracked to contact, the additional labor and expense seemed un- 
warranted. Failure to find evidence of tracking preceding that in Table 5 was, as we 
explained, an artifact of the method and situation. It should not be interpreted as evi- 
dence that tracking was not occurring. If such interpretation was unfruitful, so be it; it 
was consistent, reasonable, and necessary. 

We disagree emphatically with the conclusion that, “In short, there was no evidence of 
tracking during what may be considered the critical period of ball flight.” Slater-Hammel 
arrives at this by discarding all evidence of tracking after some hypothetical point at 
which the central process must “start.” The central processes would seem to be going 
before, during and after the swing, rather than being forced to start at some prearranged 
signal. A temporal limit must exist before which the batter must make his choice to 
start the swing—or accept the consequences. But after starting the bat he may still elect 
not to complete the swing. This action occurs at about the time the ball reaches the 
plate, suggesting that he can use cues collected as or after the swing starts. If he can 
use these late cues to check his swing, he can presumably also use these late cues to 
attempt to direct the bat to contact; but he must collect them before the angular move- 
ment of the ball exceeds the limits of eye movements and before much momentum is 
developed in the bat. Analysis of the rate of change of apparent ball size and move- 
ment suggests strongly that the first 30 or 40 ft. provide cues much less easily differen- 
tiated than the last 30 or 20 ft. (except for the last 5 or 10 ft.). Table 5 shows evidence 
of tracking in this latter period so we see no need to write off the evidence on the basis 
of invalid assumptions. 

Now concerning Slater-Hammel’s second “hypothesis”: “that the batting response is 
unlike the reaction-time responses because the sensory component is essentially elimi- 
nated.” Batting and batting reaction-time are not analogous because in batting one must 
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respond differentially to a serial stimulus situation as it develops—with certain limiting 
factors. In batting reaction-time the subject responded tc a suddenly presented pre- 
arranged signal by starting the bat along a track. Sports situations which force the 
athlete to respond to a suddenly presented stimulus for which he is set are analogous 
to the reaction-time situation. But situations in which he can see the elements develop 
and to which he must respond differentially as best he can seem to be perceptual (and 
motor) problems. A more serious objection to the batting reaction-time approach arose 
from an analysis of that study. 

Let us assume that batting can be studied as a reaction-time problem since, “In bat- 
ting, a player must react to a ball in flight, and it would seem that a batting reaction- 
time factor must enter.”! This logic seems highly questionable since it implies that every 
time a person reacts a reaction-time must enter. However, we will accept a mean of .21 
sec. for “starting reaction-time” and of .27 sec. for “movement reaction-time.” Then, 
“, .. it requires some time to bring the bat forward after starting the movement, and 
this would have to be added to the reaction time” (p. 356). The interval for moving 
the bat forward seems to be (for 86 of 96 cases) about .12 to .20 sec., or on the average 
about .16 sec. By addition we get, using mean figures, either .37 or .43 sec., or using .12 
and .20 sec., between .33 and .49. According to the rules, these estimates of the interval 
to “react and move” should have been used in going from reaction-time to the batting 
situation. 

To find the point at which the batter would have to get his signal to start to react, 
we multiply ball flight (60 ft. roughly) by the fraction, interval to “react and move” 
divided by interval of ball flight at the speeds of pitched balls. A ball travels 60 ft. at 
50 mph. in .82 sec., at 70 mph. in .58 sec., at 95 mph. in .43 sec., and at 100 mph, in .41 
sec. Using mean figures for “starting reaction-time” plus “movement-time,” we get for 
50 mph. 32 ft., for 70 mph. 45 ft., for 95 mph. 52 ft. and for 100 mph. 54 ft. Using mean 
figures for “movement reaction-time” plus “movement-time,” we get corresponding esti- 
mates of 32, 45, 60, and 63 ft. Using .20 sec. “movement-time,” the corresponding dis- 
tances for “starting reaction-time” would be 30, 42, 57, and 60 ft., and for “movement 
reaction-time” 34, 49, 65, and 69 ft. Note that for fast balls or for something like a 
“movement reaction-time” the batter may have to get his signal as or before the pitcher 
releases the ball. 

Following the specified rules for determining when the signal must occur in assuming 
batting is a reaction-time situation, we find: first, that only for very slow balls could the 
signal occur inside the last half of ball flight; and second, that the batter must be pre- 
pared to react and swing on the basis of some signal which may occur before the pitcher 
releases the ball. Since he cannot know until release by the pitcher what the situation 
will be and since the first half of ball flight provides very poor cues, how can he differ- 
entiate between even a fast and slow ball, let alone estimate its direction? We found 
(in Table 2) that the batters did respond differently to balls at different speeds. They 
responded differentially to balls arriving in different positions—by hitting them. Conse- 
quently, our opinion that batting is a perceptual (and motor) problem rather than some- 
thing analogous to reaction-time seemed valid. 

Only by disregarding movement-time can one get the estimates of “22 ft. to 30 ft. 
from home base” for a “starting reaction-time” to balls 70-95 mph., or “28 ft. to 38 ft.” 
for a “movement reaction-time” (p. 355). The proponent of “. . . surprisingly close 
agreement .. .” chooses to perpetuate the error of disregarding movement-time. For us, 
“batting reaction-time” remains a primarily academic problem with little or no demon- 
strable analogy to actual batting. In other words, there must be a better approach to the 
problem of batting than one which disregards the facts of batting and appears to provide 
no feasible basis for differentiating ball speed and direction —Alfred W. Hubbard, Uni- 
versity of Illinois. 


1A. T. Slater-Hammel and R. L. Stumpner, Batting reaction time. Research Quarterly, 21: 
858 (Dec. 1950). 
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Health 


45. Nationa Dairy Councit. Adolescent nutrition. Dairy Council Digests, 26: 4 
(Mar. 1955). 

The recommended calorie allowances for boys and girls are differentiated at age ten 
and subsequently through adolescent and mature years. At ten, at least in the United 
States, boys are more active and the same activity seems to cost a boy more energy than 
it costs a girl of the same size. 

Special attention needs to be given to providing the adolescent with enough calories, 
protein, calcium, vitamin D, and iodine. Generally, if foods are included which supply 
these nutrients, other nutrient needs will also be met. 

The energy and nutrient needs of adolescents are greater than for other periods of 
life with the exception of pregnancy and lactation in women.—Paul Hunsicker. 


46. E1cuentaus, Joun E. Cataract. Today's Health, 33: 32-33 (Mar. 1955). 

Cataract is but a fogging of the lens in the eye. Difficulty with vision, especially for 
close work, and a faint grayness of the pupils are usually the first symptoms noticed. If 
ihe eyes are left alone, this grayness of the pupil will ultimately turn a dead white, the 
lusterless white of a ripe cataract. Except for this whiteness, there is no sign of trouble— 
no inflammation, no growth of tissues, no irritation of the lids. Many cataracts start in 
the middle part of the lens because this part receives the least nourishment. Some 
babies are born with cataracts, the lens never having developed properly. Diabetes and 
glaucoma cause changes in the chemistry or circulation of the eye fluid and speed the 
formation of cataracts. There is only one cure—the clouded lens must be removed. 
Such operations under local anesthesia have proven amazingly successful under compe- 
tent eye surgeons.—J. Grove Wolf. 


47. Ercnentaus, Joun E. Sciatica. Today's Health, 33: 23: 40-43 (Apr. 1955). 

Sciatica doesn’t kill, doesn’t take away your ability to move, doesn’t make you weak, 
but it does disable you. Most medical authorities agree that the cause of sciatica is usu- 
ally rupture of an intervertebral disc. There is a difference of opinion even among top 
authorities on the proper way to handle a ruptured disc. There will be some disability 
after an operation, so one should not get out the knife too soon. Probably the best single 
test to show if a ruptured intervertebral disc needs an operation is the straight-leg-raising 
test. One noted authority will not operate on anyone who can lie on his back and raise 
the affected leg more than halfway off the table without bending the knee. One may 
help prevent sciatica by (1) avoiding lingering mild infections; (2) lifting properly 
without bending the back—lift with legs, not spine; (3) keeping posture normal; and 
(4) not greeting every backache with a softer and softer mattress.—J. Grove Wolf. 


48, Eppricut, E. S., and Caartotte Roperucx. Diet and nutritional status of Iowa 
school children. Amer. J. of Public Health. 45: 4: 464-471 (Apr. 1955). 

The food intake and nutritional status of children were studied in Iowa. There were 
61 schools considered with a sampling of 1,200 students. The core of diets of Iowa chil- 
dren, like their elders, consisted of liberal amounts of meats, potatoes, sweet desserts, 
table fat, bread, and cereal. The milk consumption was two to three cups daily. Diets 
were slightly low in vitamins C and A. The most significant problems were dental caries 
and weight control for the teen-age girls. Iowa children need only slight improvements 
in their diets to obtain good nutrition —Joseph Lantagne. 
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49. Jouns, Epwarp B. Field experiences in health education for prospective teachers. 
J. of School Health, 25: 5: 123-131 (May 1955). 

The preparation of teachers in health education is one of the major objectives facing 
school health education today. In the preparation of health educators are these persons 
obtaining the kinds of field experiences necessary to meet the challenges they are about 
to face? Dr. Johns summarizes by stating “First, the need for purposeful experiences in 
the preparation for teachers in health education; secondly, the meaning of laboratory 
experiences, including field experiences, and desirable outcomes; thirdly, practices and 
current problems relating to laboratory experiences in the preparation of school health 
educators.—Joseph Lantagne. 


50. Sax, Jonas E. Present status of the problem of vaccination poliomyelitis. Amer. 
J. of Public Health, 45: 3: 285-297 (Mar. 1955). 

There are many who hold that an effective means of preventing paralytic poliomyelitis 
will come from a live-virus vaccine. This live-virus vaccine should be more desirable 
even if a non-living vaccine is found to be effective. Life-long immunity seems to be 
associated with recovery from the naturally acquired infection. Most scientists agree 
that it isn’t possible to reproduce the degree nor the quality of immunity from non- 
living virus that is attainable from living virus. Results from experiments indicate 
that, even with a non-living vaccine, by proper spacing and dosage, it will produce a 
booster effect. From these data either a living or non-living vaccine should produce 
desirable results.—Joseph Lantagne. 

51. Tobey, James A. Backache. Today's Health, 33: 42-43 (Feb. 1955). 

Over 57 possible causes of backache have been listed by various authorities including 
poor posture, sudden strains, arthritic conditions, and many others. Pains in the back 
are almost invariable accompaniments of many of the acute communicable diseases such 
as influenza, pneumonia, and other acute respiratory disorders. The numerous physical 
causes of backache are estimated to account for only a little more than half the poor 
aching backs with which the American people are afflicted. The remainder are not due 
to physical causes, but arise from deep-seated nervous and emotional disorders, It is 
much better to prevent the affliction of aching backs than to effectively treat this condi- 
tion. This can generally be accomplished by the maintenance of general good health; 
by proper posture; by avoiding overweight; by preventing strains and accidents; by 
dodging contagions; and by cultivation of good mental hygiene.—J. Grove Wolf. 

52. Wirry, Pau, Comics, television and our children. Today’s Health, 33: 18-21 (Feb. 
1955). 

Over 90 per cent of boys and girls between eight and 13 years of age read comic books 
regularly. Fourth-, fifth-, and sixth-graders appear to be the most avid readers. The 
attraction of comic magazines continues throughout the middle grades, and many high 
school students read comics. 

Children spend nearly 20 hours a week watching television during the year, with the 
high school student spending somewhat less time, averaging between 14 to 17 hours a 
week. No significant differences were found between the IQ’s of pupils and the hours 
devoted to TV. Excessive viewing of television, however, seemed to be associated with 
somewhat lower academic attainment. The teachers concluded that the desirability or 
undesirability of the amount of televiewing could be determined only by a complete 
study of each child. Parents and children should together plan balanced programs of 
recreation not only with time for children to see TV and read comic books, but also with 
time for them to engage in other activities of greater value.—J. Grove Wolf. 


Physiology 
53. Branx, Lane B. Effects of warm-up on speed. Athletic J. 35: 10: 45-46 (Feb. 
1955). 
A group of 16 experienced track and field athletes were divided at random into two 
groups and timed daily in the 120-yard dash for 22 consecutive days. One group par- 
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ticipated in time trials following an optimum warm-up pattern, while the other group 
underwent similar recorded runs preceded by a minimum pattern of activitiy. The warm- 
up routines were interchanged on alternate days. In a second experiment, 48 individuals 
with no prior track and field experience volunteered as subjects, but only 38 completed 
the research period. Again the alternating pattern was employed, but the distance run 
was reduced to 100 yards. The difference between the mean optimum warm-up and 
mean minimum warm-up for each of the two groups was found to be significant at better 
than the one-tenth of one per cent level of confidence. It was found that warm-up is of 
benefit to performance in sprinting and for the groups investigated there was no apparent 
difference between the effect of the warm-up routines upon experienced and non- 
experienced individuals.—J. Grove Wolf. 


54. Dit, D. B., J. C. Seep anp F. N. Marzutut. Energy expenditure in bicycle riding. 
J. Applied Physiol. 7: 320. (1954). 

Energy expenditure in riding a bicycle with balloon tires (24%4-in, diam.) and narrow 
tires (14%4-in. diam.) was made. The net oxygen consumption was 0.19 L/min. greater 
with balloon tires. This difference is explained by a greater friction caused by the balloon 
tire. At a speed of 15 mph the propelling force for the balloon tire is 6.5 gm/Kg. gross 
load, whereas for the narrow tire it is 2.5 gm/Kg. Bicycles were ridden on an elec- 
trically driven treadmill.—Leslie Leggett and Peter V. Karpovich. 


55. Gersten, J. W. Changes in hydration of muscle and tendon following the applica- 
tion of ultrasonic energy. Arch. Phys. Med. 36: 140-148 (Mar. 1955). 
Exposure to ultrasonic vibration increased the relative water content of 24 normal 
frog gastrocnemii, while decreasing that of 30 normal Achilles tendons. The changes 
were reversible. Since anoxia, temperature elevation and 2, 4-dinitrophenol (DNP) 
produced little change in the dehydration effects elicited by ultrasound, the author 
suggested that physical factors were of primary importance in controlling the water con- 


tent. This hypothesis was given support by the changes in the hydration occurring after 
treatment with glycerol or 0.5M KCl, in which case increased hydration was observed 
in muscle and also in tendon.—Clifford E. Keeney and Peter V. Karpovich. 


56. HeLiepranpt, Frances A. Physiology of motor learning as applied to the treatment 
of the cerebral palsied. Qaurt. Rev. Pediat. 7: 5-14. (1952). 

The degree of neuromuscular training affects all phases of the total rehabilitation pro- 
gram. The basic motor patterns which are not under volitional control, as in the 
cerebral palsied individual, can be controlled by training. 

Attainment of complex motor acts by utilizing individual muscles or isolated muscle 
groups may be unphysiologic. Motor skills appear to develop out of total patterns of 
response. As overloading is important for the development of power so excessive 
stimulation may be just as important in acquiring motor skills. Patient should be 
positioned to permit functioning of normal developmental patterns, rather than placed 
in a position providing for only isolated muscular movements.—Elena M. Sliepcevich. 

57. Henry, F. M. Aerobic oxygen consumption and alactic debt in muscular work. 
J. Applied Physiol., 3: 427-438 (Jan. 1951). 

Oxygen intake during exercise and recovery was measured continuously while 12 sub- 
jects worked on a bicycle ergometer. The load ranged from 15 to 910 kg. m/min at a 
speed of 62 r.p.m. The steady state oxygen intake a was linearly proportional to meta- 
bolic work load, and there were no individual differences in intake rate. The intake rose 
to the steady-state value in accord with an exponential law, the velocity constant k of 
this rise being correlated r = 0.79 with the alactic recovery constant k’, and exhibiting 
pronounced individual differences. 

In another experiment, 35 subjects worked at a constant load of 614 kg. m/min. It was 
found that steady-state intake a and the recovery exponent k’ were uncorrelated, Ath- 
letic training did not influence a but caused a significant increase in k’. All of the re- 
sults were explained theoretically by an exponential mathematical model which states 
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that oxidizable substrate is produced linearly in proportion to the work load, and is con- 
tinuously oxidized, so that the steady state represents a balance between substrate produc- 
tion and oxidation. The initial lag of intake at the beginning of exercise is not due to 
slow adjustment of the cardiovascular system, but is a necessary consequence of the fact 
that substrate cannot be oxidized until it has first been produced by the muscular work.— 
F. Henry. 


58. Kaptan, Harry A., and Jerrerson Browper. Observations on the clinical and brain 
wave patterns in professional boxers. J. of Am. Med. Assn., 156: 1138-1144 (Nov. 
20, 1954). 

The authors have conducted a study, over a period of four years, of 1,043 professional 
fighters with particular reference to responses resulting from contact trauma to the head 
inflicted in the boxing arena. One or more electroencephalograms was made for each 
subject, the total number of such examinations for the entire study being 1,400. The 
oludy was carried out in three parts: clinical observations at ringside, cinematography, 
and electroencephalography. 

Observations at the ringside and more detailed examination in the dressing room after 
the fight failed to reveal any abnormal neurological features, even in those contenders 
who lost their bouts by a knockout. A critical review of slow-motion photography of seg- 
ments of fights confirmed one major impression gained at the ringside, namely, that most 
blows to the head were short of their mark or deflected by the opponent. Electroenceph- 
alographic findings were classified according to 12 types of patterns. The greatest 
percentage of fighters was found in the slightly slow anterior type of pattern, consisting 
mainly of moderate voltage, 5 to 7 per sec. frequency, but with a fair amount of low 
voltage beta and alpha activity intermixed and superimposed in the frontal region and 
constant well-formed 10 per sec. alpha posteriorly. The physical features and perform- 
ance data of each fighter were then tabulated for correlation with the electroencephalo- 
gram. Specifically, the age of the contestant, the number of fights, the rating of the 
fighter in relation to the title holder, the type of fighter, the weight division, the number 
of wins or losses, and the number of losses by technical or 10 sec. knockouts were the 
items used in the correlations. Results failed to reveal any significant statistical findings, 
except in the rating class, in which statistical results indicated that those lower in ring 
rating had the greater percentage of disorganized electroencephalograms. The top rated 
fighters were found most prominently in the low voltage and fast types. 

From one to six repeat electroencephalographic examinations were carried out on 197 
fighters. While changes occasionally were noted in the records of some subjects, there 
was no uniform pattern of change with continuity, sytle, or frequency of fighting. Many 
fighters after one, two, and three years of active boxing had no change whatsoever in their 
electroencephalographic tracing. Even the records taken on 40 fighters within 10 minutes 
of the cessation of a bout showed no change over their prefight record. The authors 
believe that the electroencephalographic pattern of professional fighters, as well as mem- 
bers of the general populace, may be more inherent in the person than secondary to 
occupational pursuit.—Digest of Neurology and Psychiatry (Dec. 1954). 


59. Koepke, G. H., A. J. Murpny, J. W. Rat, Jr., and D. G. Dickinson, An electro- 
myographie study of some muscles used in respiration, Arch. Physical Med., 36: 
217-222 (Apr. 1955). 

Respiratory movements of the chest were recorded by means of a pneumograph at- 
tached to a pressure transducer. The action potentials were picked up by needles and 
recorded by a single channel electromyograph. Forty-five men were used as subjects. 

During a quiet inspiration only the first and the second intercostal muscles were active. 
During a deep inspiration intercostal muscles up to and including the seventh were used. 
It was difficult to differentiate between the internal and external intercostals. The ab- 
dominals were more active during expiration than inspiration. During a maximum ex- 
piration, obliques were used more than the rectus.—Peter V. Karpovich. 
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60. Zanker, H. T., R. A. Suiptey and R. E. Criark. Effect of posture and local pres- 
sure on venous circulation time of the leg. Arch. Physical Med., 36: 226-231 
(Apr. 1955). 

Circulation time was determined by injecting radio-iodine (1131) into the dorsal vein 
of the foot and detecting the appearance of I'31 in the skin over the groin by many a 
scintillation counter. 

The average time for 24 patients was 28 seconds, with a range of from 8 to 90 sec. 
Application of pressure to popliteal region reduced the circulation time to 14 sec. with 
a range of from 3 to 50 sec. Pressure of 10 to 20 mm applied to calf reduced the circu- 
lation time to 21 sec. Flexion of the knee and assumption of a sitting position produced 
no consistent effect. Time was increased on standing position. Elevation of legs de- 
creased the circulatory time. In patients with arteriosclerosis, obliterance, elevation of 
Jegs caused an increase in circulation time.—Peter V. Karpovich. 


Psychology 
61. Bernperc, Raymonp E. A measure of social conformity. J. of Psych., 39: 89-96 
(Jan. 1955). 

Two forms of an attitude scale to measure social conformity were constructed, using 
items classified in such categories as moral values, positive goals, reality testing, ability 
to give affection, tension level, and impulsivity. The direction of perception technique 
of altitude measurement was used. These scales were initially given to two groups of 
boys: 100 high school seniors of low socio-economic level and low IQ, and 125 boys from 
a broad range of socio-economic level, average IQ of 114 on Otis short form. 

A scale (the HRI) was revised and to secure evidence of discrimination between 
groups was given to: (a) 160 male inmates of a California youth prison, (b) 101 Police 
Science majors. Significant mean differences between conforming and non-conforming 
groups were found by both parametric and non-parametric tests of significance. The 
scale was further checked to determine the influence of intelligence, socio-economic 
status, cultural background (sex, age within limits, and religion) on the responses. These 
factors were found insignificant in their influence on the responses, 

The authors recommend this scale as a research tool, as a tool for screening where 
social conformity reflects desirable traits, and as a demonstration tool of socio-psyc ho- 
logical concept and indirect attitude measurement method.—Frances Z. Cumbee. 


62. Fernrucn, Atrrep. Improving the employability and attitudes of the “difficult-to- 
place” persons. Psycho. Monographs, No. 392. 69: 7 (1955). 

The effect of a program which integrated vocational counseling, casework, and a shel- 
tered workshop on employability was studied. Fifty-two adults in Montreal who had not 
been able to find and keep employment participated in the integrated program. Data on 
such items as time employed, attitudes toward work and self, and characteristics of the 
hard-to-place was collected prior to participating in the program and again one year 
after the program. 

A comparison of the data before and after the program indicated the following results: 
(a) a significant increase in both employability and stability of employment following the 
program; (b) 7 of the 16 attitudes checked showed significant correlation with employ- 
ability and that each of the 7 attitudes showed significant improvement during the pro- 
gram; and (c) 7 of the characteristics (age, dependents, able to get along well with 
people, disabilities, length of former employment, length of time in the country, and time 
getting financial assistance) differentiated significantly between high and low employ- 
ability after the program. None of the attitudes seem to so differentiate.—Frances Z. 
Cumbee. 


Henkin, Rosert I. A factorial study of the components of music. J. of Psych. 39: 
161-181 (Jan. 1955). 

The principal axis method of factoring was used on the response of students to rec- 

ords in an effort to isolate musical factors in esthetic appreciation. Ten records, chosen 
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from three periods (classic or pre-classic, romantic, and modern) to meet specifications 
under such categories as rhythm, melody, harmony, and color were used. The experiment 
was repeated twice with the same group of music appreciation students at UCLA; 243 
taking the test the first time, and 193 taking the test the second time. 
Six factors were obtained from the analysis, two of which the author felt justified in 
naming: (1) a melodic factor, and (2) a rhythmic factor. Some evidence was presented 
that the third factor might be an Orchestral Color Factor. The fourth factor was left 
unidentified and the author felt that the fifth and sixth factors were residual factors 
that could be used for information purposes in further studies. A study of factors not P 
clearly defined was recommended in the hope that a clear definition of musical factors 
might be ascertained.—Frances Z. Cumbee. 


64. Horst, Paut. A technique for the development of a differential prediction battery. 
Psycho. Monographs. No. 380. 68: 9 (1954). 

In a brief discussion of the differential prediction problem the need for a single classi- 
fication battery of tests, differentially weighted, that would allow the prediction of success 
in each of a wide variety of activities was pointed out. The index of differential predic- 
tion efficiency for the battery was defined as a function of the variances of the predicted 
difference scores when all possible pairs of criterion variables were used. The larger the 
variance, the greater the differential prediction efficiency. Using the predicted criterion 
scores, it was shown further that mathematically an equivalent definition was obtainable 
by maximizing the difference between the average variance and the average co-variance. 

An iterative procedure was used to develop an alternative technique that permitted pro- 
gressively selecting the predictor which combined with the previously selected set would 
yield the largest index of differential prediction. This alternative technique was developed 
because the labor involved in exactly selecting a subset of predictors that would max- 
imize the index would be prohibitive. 

A numerical example was used to demonstrate the computational procedures. Still fur- 
ther, equations were developed for determining the matrix of regression vectors and 
multiple correlation coefficients without the necessity of calculating the inverse of the 
predictor intercorrelation matrix.—Frances Z. Cumbee. 


65. Horst, Paut. A technique for the development of a multiple absolute prediction 
battery. Psycho. Monographs. No. 390. 69:: 5 (1955). 

A contrast between the multiple absolute prediction problem and the differential pre- 
diction problem was presented. It was pointed out that the multiple absolute prediction 
problem is one of selecting a subset of texts which would have the highest prediction 
efficiency for all criterion variables, regardless of how it differentiated among them. The 
sum of variances of the predicted criteria with no attention to the co-variance was used as 
the definition of the index of prediction efficiency. 

Mathematical derivation of formulas for obtaining the matrix of beta weights and mul- 
tiple correlations was presented. A numerical example is given which shows the compu- 
tational procedure in use in selecting the absolute predictors, solving regression vectors 
and multiple correlations.—Frances Z. Cumbee. 
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Guide to Authors 


IN LINE WITH the over-all goal of making Association publications yield the 
greatest value to the individual and the profession, the following is a guide 
for the preparation of manuscripts for the Research Quarterly, recognizing 
general techniques employed by research publications. 


Article Manuscripts 
Manuscripts should be sent to the Editor (AAHPER, 1201 Sixteenth Street, 


Northwest, Washington 6, D. C., who will see that each one is read by at least 
three members of the Research Quarterly Board of Associate Editors. On the 
basis of the three reviews, the Editor will advise the author as to the suita- 
bility of the paper or the desirability for revision. Papers are not judged by 
arbitrary standards but on their content of new research results in the field 
of physical education, health education, and recreation, presented with the 
greatest brevity compatible with scientific accuracy and clarity (see October 
1951 Quarterly, pp. 392-4). 

Since three members of the Board of Associate Editors review an article, 
it is requested that three clear copies of the manuscript be submitted in order 
to facilitate reviewing. A fourth copy of the article should be retained. 

Typewritten manuscript should be double-spaced on white paper of ordi- 
nary weight and standard size (814 x 11 inches). A brief abstract of the 
article, 100 words or less, should be typed double space on a separate sheet. 
See abstracts at head of Quarterly articles for style. 

The sheets of manuscript should be kept flat and fastened with clips which 
can be removed easily. The pages of the typewritten copy should be numbered 
consecutively in the upper right-hand corner. Paragraphs should be num- 
bered consecutively throughout the manuscript. 


Notes and Comments 


Notes on minor research and on apparatus, objective critical comments, 
and summaries of status suveys will be printed in the Notes and Comments 
section. Note that simple status surveys are no longer acceptable as regular 
Quarterly articles, by decision of the Research Council. Such studies will 
therefore be published in brief form (300-500 words) under Notes and 
Comments. 


Headings 

The article should be arranged so as to indicate relative values of heading 
and subheadings. 

Usually four gradations are sufficient: (a) article title, (b) first subhead 
appearing in boldface aligned left on page (underscored in manuscript with 
wavy line) (c) second subhead (if necessary) appearing in small caps aligned 
left on page, (d) third subhead, to appear in italic (underscored in manu- 
script), not centered, but run in at the beginning of the paragraph or section. 
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All headings should be typed in lower case with initial capitals, except for 
(c) above, which should be typed in capital letters. 
FOOTNOTES 

Footnotes are not to be used for references or literature citations. They are 
rather used for the purpose of acknowledgment, special explanation, supple- 
mentary information, etc. (See examples below.) 

Type footnotes (if any) on separate sheets, as many footnotes as convenient 
being written on a sheet. Footnotes should be numbered from 1 up for each 
article; a corresponding numeral appearing in the text. Asterisks should not 
be used, 

Examples of Footnotes: 

* This study was made under the direction of Dr. Arthur T. Slater-Hammel in the Re- 
search Laboratories, School of Health, Physical Education, and Recreation, Indiana Uni- 
versity, Bloomington, Indiana. 

* All measurements of the hand were recorded in centimeters and height was recorded 
in inches. The hand measurements were taken by Everett and reliability coefficients of 
above .90 were found for each measurement used in the study. 

* For their wholehearted co-operation in facilitating collection of the data, special grati- 
tude is extended to Superintendent Clarence Hines and the 1950-51 principals of the 
Adams, Condon, Edison, Francis Willare, Harris, Howard, Lincoln, River Road, and 
Whiteaker schools. 


Documentation 


CITATIONS OF LITERATURE 

Citations of literature should be segregated alphabetically by author’s last 
name at the end of each article, under the caption of “REFERENCES.” Do 
not treat them as footnotes. (See above.) 

The literature citations, listed alphabetically, should be numbered consecu- 
tively, their location in the text being indicated by corresponding numbers 
written in full size and enclosed in parentheses: for example, (1) (2, 3). If 
there are several references in the text to a citation, the specific pages may be 
indicated thus: (1, p. 117), (1, pp. 162-3). 

A uniform style should be maintained in writing citations. Do not enclose 
titles of chapters and articles in quotation marks. Italicize (underscore in 
manuscript) names of books and periodicals, bulletins, etc. (See examples 
below.) 

Uniform sequence of data shoud be observed, as follows: For a book— 
Author’s name (last name first); title of article or chapter; name of book; 
place of publication; publisher; year date. For a periodical—Author’s name 
(last name first) ; title of article or chapter; name of periodical; volume num- 
her; inclusive page numbers; year date. 

Examples of References Appearing at End of Article: 

1. American AssociaTION ror Heattu, Puysicat EpucaTIon, AND RECREATION. Sug- 
gested platforms for health education. Journal of the American Association for 
Health-Physical Education-Recreation 18:436 (Sept. 1947). 

2. American Association oF ScHoot Apministrators. Health in Schools. Revised 
edition. Washington, D. C.: the Association, a department of the National Educa 
tion Association. pp. 266-7. 
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Deaver, G. G., Exercise and Heart Disease. Research Quarterly, 26: 24-34, 1939. 

OcpEN, JEAN, AND Jess OcpeN. Small Communities in Action. New York: Harper & 
Brothers, 1946. 

Portrer, Joun Nicuotas. Physical Fitness of Junior High School Boys. Unpublished 
Master’s thesis, University of California, Berkeley, 1942, 


Tabular Matter 

Each table should have a descriptive heading and should be specifically 
referred to in the text by number, e.g., “Table 1(” etc., never as “the above 
table” or “the following table.” Number tables from 1 up for the entire manu- 
script, using Arabic numerals. Do not duplicate data by giving it in both 
tables and graphs. 

Tables should be double-spaced typewritten, like the rest of the material in 
the manuscript. They should be typed on separate sheets, as the printer will 
set them on a different machine from the one used for the text matter. If a 
table continues on a second sheet, it is not necessary to repeat the boxheads, 
since the printer will repeat from the original boxheads, when necessary. 

The word “TABLE” should be written in capital letters, as: “TABLE 1”; 
the table title should be written in lower case letters with initial capitals, and 
centered over the table. Tables should be ruled as desired, except that no rules 
will appear at the extreme right and left edges of the table. No double rules 
are to be used, unless necessary for clarity. 

Well-known statistical formulas should be omitted. Extensive tabular ma- 
terial, raw data, and appendixes should not be printed; the author can men- 
tion in a footnote that he will supply such material in mimeographed form 
on request. 

Hlustrations 

Illustrative material is of two types: pen and ink drawings, which are re- 
produced by the line engraving process; and photographs, wash drawings, 
stipple drawings (in short, anything containing shading), which are repro- 
duced by the halftone process. 

Line engravings are always treated as text figures and should be so desig- 
nated. All drawings should be made with India ink, preferably on white bris- 
tol board plate, 1 ply or 2 ply, which is sufficiently transparent to permit 
tracing if back lighting (e.g., a window pane) is used. Avoid graph paper 
for the reproduction copy, as the printing interferes with proper inking and 
the paper permits no corrections, Sometimes it is desirable to ink in the prin- 
cipal guide lines so that the curves can be more easily read. Good examples 
of graphs can be seen in The Research Quarterly for October 1953, pages 332 
and 366. 

Lettering should be plain and large enough to reproduce well when the 
drawing is reduced to the dimensions of the printed page (4% x 7 inches). 
Most figures can be advantageously drawn for a linear reduction of one-half 
or one-third. Be sure to draw the lines heavy enough so that they will not be 
overly thin after reduction. Explanatory lettering should be included within 
the chart. Typewritten lettering does not reproduce well; it is much better to 
use a LeRoy or similar lettering device. 
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Care should be taken not to waste space, as this means greater reduction 
and a less satisfactory illustration. Often it is possible to combine several 
curves in one figure and enable the reader to make comparisons. 

Halftones are treated as figures and should be so designated. Frequently, 
several halftones can be grouped to form an attractive full page, in which 
case they should be numbered consecutively, in Roman numerals. Photographs 
should be in the form of clear black-and-white prints on glossy paper. Care 
should be taken to see that they cannot be bent or folded in handling and 
paper clips should not be used. All imperfections are reproduced. 

Figures should each be numbered consecutively from I up for the entire 
manuscript. Use Roman numerals to number figures, and Arabic numerals to 
number tables. The legends for the illustrations should be typed upon a 
separate sheet placed at the end of the manuscript. Care should be taken to 
indicate plainly in the text the exact location of all illustrations and tables. 

The Association will assume complete engraving expense. 


Special Points of Style 
USE OF NUMBERS 

Use Arabic figures for all definite weights, measurements, percentages, and 
degrees of temperature (for example: 2 kgm., 1 inch, 20.5 ec., 300° C.). Spell 
out all indefinite and approximate periods of time (for example: over one 
hundred years ago, about two-and-one-half hours). For numerals used in a 
general sense, spell out numbers through ten and use Arabic figures for 11 
and over (seven times, five years old, 11 students). 


ABBREVIATIONS AND SYMBOLS 

Standard abbreviations should be used whenever the weights and measure- 
ments are used with figures, i.e., 10 kg., 6.25 cc., etc. The forms to be used 
(for both singular and plural) are: ft., ft.-lb., ft/sec, in., yd., min., hr., sq. ft.. 
sq. in., rpm. Gram should be spelled out in all cases to avoid possible con- 
fusion with grain; also spell out mile. All obscure and ambiguous abbrevia- 
tions should be avoided. Symbols used should follow the notation listed in 
Research Methods (AAHPER), pp. 518-20 and 522-25. Most common are: 


M = mean r = Pearson correlation 
Mdn = median "bis = biserial correlation 

N = number of individuals "1 = reliability coefficient 

n = number of measurements x? = chi square 

o = standard deviation F = variance ratio 

Om = standard error of mean t = Student (Fisher) t ratio 


Per cent should be two words. Use per cent sign (%) in tables or when it 
appears in parentheses in text. 


Proofreading 

The author will receive his original manuscript and any engraver’s proofs 
with the galley proofs of his article for correction. A reprint order blank will 
be enclosed for the author’s convenience. 


Corrected proofs and original manuscripts are to be returned 
within 48 hours by first-class mail to the Editor, AAHPER, 1201 
Sixteenth Street, Northwest, Washington 6, D. C. 
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